nLab geometry of physics -- perturbative quantum field theory

Redirected from "A first idea of quantum field theory".

Contents

These notes (pdf, 323 pages) mean to give an expository but rigorous introduction to the basic concepts of relativistic perturbative quantum field theories, specifically those that arise as the perturbative quantization of Lagrangian field theories – such as quantum electrodynamics, quantum chromodynamics, and perturbative quantum gravity appearing in the standard model of particle physics.

$\,$

This is one chapter of geometry of physics.

Previous chapters: smooth sets, supergeometry.

$\,$

Geometry
Spacetime
Fields
Field variations
Lagrangians
Symmetries
Observables
Phase space
Propagators
Gauge symmetries
Reduced phase space
Gauge fixing
Quantization
Free quantum fields
Interacting quantum fields
Renormalization

$\,$

For broad introduction of the idea of the topic of perturbative quantum field theory see there and see

PhysicsForums-Insights: Introduction to Perturbative Quantum Field Theory

Here, first we consider classical field theory (or rather pre-quantum field theory), complete with BV-BRST formalism; then its deformation quantization via causal perturbation theory to perturbative quantum field theory. This mathematically rigorous (i.e. clear and precise) formulation of the traditional informal lore has come to be known as perturbative algebraic quantum field theory.

We aim to give a fully local discussion, where all structures arise on the “jet bundle over the field bundle” (introduced below) and “transgress” from there to the spaces of field histories over spacetime (discussed further below). This “Higher Prequantum Geometry” streamlines traditional constructions and serves the conceptualization in the theory. This is joint work with Igor Khavkine.

In full beauty these concepts are extremely general and powerful; but the aim here is to give a first precise idea of the subject, not a fully general account. Therefore we concentrate on the special case where spacetime is Minkowski spacetime (def. below), where the field bundle (def. below) is an ordinary trivial vector bundle (example below) and hence the Lagrangian density (def. below) is globally defined. Similarly, when considering gauge theory we consider just the special case that the gauge parameter-bundle is a trivial vector bundle and we concentrate on the case that the gauge symmetries are “closed irreducible” (def. below). But we aim to organize all concepts such that the structure of their generalization to curved spacetime and non-trivial field bundles is immediate.

This comparatively simple setup already subsumes what is considered in traditional texts on the subject; it captures the established perturbative BRST-BV quantization of gauge fields coupled to fermions on curved spacetimes – which is the state of the art. Further generalization, necessary for the discussion of global topological effects, such as instanton configurations of gauge fields, will be discussed elsewhere (see at homotopical algebraic quantum field theory).

Alongside the theory we develop the concrete examples of the real scalar field, the electromagnetic field and the Dirac field; eventually combining these to a disussion of quantum electrodynamics.

running examples

field	field bundle	Lagrangian density	equation of motion
real scalar field	expl.	expl.	expl.
Dirac field	expl.	expl.	expl.
electromagnetic field	expl.	expl.	expl.
Yang-Mills field	expl. , expl.	expl.	expl.
B-field	expl.	expl	expl.

$\,$

field	Poisson bracket	causal propagator	Wightman propagator	Feynman propagator
real scalar field	expl. , expl.	prop.	def.	def.
Dirac field	expl. , expl.	prop.	def.	def.
electromagnetic field		prop.		prop.

$\,$

field	gauge symmetry	local BRST complex	gauge fixing
electromagnetic field	expl.	expl.	expl.
Yang-Mills field	expl.	expl.	…
B-field	expl.	expl.	…

$\,$

interacting field theory	interaction Lagrangian density	interaction Wick algebra-element
phi^n theory	exp.	expl.
quantum electrodynamics	expl.	expl.

$\,$

References

Pointers to the literature are given in each chapter, alongside the text. The following is a selection of these references.

The discussion of spinors in chapter 2. Spacetime follows Baez-Huerta 09.

The functorial geometry of supergeometric spaces of field histories in 3. Fields follows Schreiber 13, further developed by Giotopoulos & Sati 2023.

For the jet bundle-formulation of variational calculus of Lagrangian field theory in 4. Field variations, and 5. Lagrangians we follow Anderson 89 and Olver 86, further developed by Giotopoulos & Sati 2023; for 6. Symmetries augmented by Fiorenza-Rogers-Schreiber 13b.

The identification of polynomial observables with distributions in 7. Observables was observed by Paugam 12.

The discussion of the Peierls-Poisson bracket in 8. Phase space is based on Khavkine 14.

The derivation of wave front sets of propagators in 9. Propagators takes clues from Radzikowski 96 and uses results from Gelfand-Shilov 66.

For the general idea of BV-BRST formalism a good review is Henneaux 90.

The Lie algebroid-perspective on BRST complexes developed in chapter 10. Gauge symmetries, may be compared to Barnich 10.

For the local BV-BRST theory laid out in chapter 11. Reduced phase space we are following Barnich-Brandt-Henneaux 00.

For the BV-gauge fixing developed in 12. Gauge fixing we take clues from Fredenhagen-Rejzner 11a.

For the free quantum BV-operators in 13. Free quantum fields and the interacting quantum master equation in 15. Interacting quantum fields we are following Fredenhagen-Rejzner 11b, Rejzner 11, which in turn is taking clues from Hollands 07.

The discussion of quantization in 13. Quantization takes clues from Hawkins 04, Collini 16 and spells out the derivation of the Moyal star product from geometric quantization of symplectic groupoids due to Gracia-Bondia & Varilly 94.

The perspective on the Wick algebra in 14. Free quantum fields goes back to Dito 90 and was revived for pAQFT in Dütsch-Fredenhagen 00. The proof of the folklore result that the perturbative Hadamard vacuum state on the Wick algebra is indeed a state is cited from Dütsch 18.

The discussion of causal perturbation theory in 15. Interacting quantum fields follows the original Epstein-Glaser 73. The relevance here of the star product induced by the Feynman propagator was highlighted in Fredenhagen-Rejzner 12. The proof that the interacting field algebra of observables defined by Bogoliubov's formula is a causally local net in the sense of the Haag-Kastler axioms is that of Brunetti-Fredenhagen 00.

Our derivation of Feynman diagrammatics follows Keller 10, chapter IV, our derivation of the quantum master equation follows Rejzner 11, section 5.1.3, and our discussion of Ward identities is informed by Dütsch 18, chapter 4.

In chapter 16. Renormalization we take from Brunetti-Fredenhagen 00 the perspective of Epstein-Glaser renormalization via extension of distributions and from Brunetti-Dütsch-Fredenhagen 09 and Dütsch 10 the rigorous formulation of Gell-Mann Low renormalization group flow, UV-regularization, effective quantum field theory and Polchinski's flow equation.

$\,$

Acknowledgement

These notes profited greatly from discussions with Igor Khavkine and Michael Dütsch.

Thanks also to Marco Benini, Klaus Fredenhagen, Arnold Neumaier and Kasia Rejzner for helpful discussion.

$\,$

Geometry

The geometry of physics is differential geometry. This is the flavor of geometry which is modeled on Cartesian spaces $\mathbb{R}^n$ with smooth functions between them. Here we briefly review the basics of differential geometry on Cartesian spaces.

In principle the only background assumed of the reader here is

usual naive set theory (e.g. Lawvere-Rosebrugh 03);
the concept of the continuum: the real line $\mathbb{R}$ , the plane $\mathbb{R}^2$ , etc.
the concepts of differentiation and integration of functions on such Cartesian spaces;

hence essentially the content of multi-variable differential calculus.

We now discuss:

Abstract coordinate systems
Fiber bundles
Synthetic differential geometry
Differential forms

As we uncover Lagrangian field theory further below, we discover ever more general concepts of “space” in differential geometry, such as smooth manifolds, diffeological spaces, infinitesimal neighbourhoods, supermanifolds, Lie algebroids and super Lie ∞-algebroids. We introduce these incrementally as we go along:

more general spaces in differential geometry introduced further below

											higher differential geometry
differential geometry	smooth manifolds (def. )	$\hookrightarrow$	diffeological spaces (def. )	$\hookrightarrow$	smooth sets (def. )	$\hookrightarrow$	formal smooth sets (def. )	$\hookrightarrow$	super formal smooth sets (def. )	$\hookrightarrow$	super formal smooth ∞-groupoids (not needed in fully perturbative QFT)
infinitesimal geometry, Lie theory							infinitesimally thickened points (def. )		superpoints (def. )		Lie ∞-algebroids (def. )
											higher Lie theory
needed in QFT for:	spacetime (def. )		space of field histories (def. )				Cauchy surface (def. ), perturbation theory (def. )		Dirac field (expl. ), Pauli exclusion principle		infinitesimal gauge symmetry/BRST complex (expl. )

$\,$

Abstract coordinate systems

What characterizes differential geometry is that it models geometry on the continuum, namely the real line $\mathbb{R}$ , together with its Cartesian products $\mathbb{R}^n$ , regarded with its canonical smooth structure (def. below). We may think of these Cartesian spaces $\mathbb{R}^n$ as the “abstract coordinate systems” and of the smooth functions between them as the “abstract coordinate transformations”.

We will eventually consider below much more general “smooth spaces” $X$ than just the Cartesian spaces $\mathbb{R}^n$ ; but all of them are going to be understood by “laying out abstract coordinate systems” inside them, in the general sense of having smooth functions $f \colon \mathbb{R}^n \to X$ mapping a Cartesian space smoothly into them. All structure on generalized smooth spaces $X$ is thereby reduced to compatible systems of structures on just Cartesian spaces, one for each smooth “probe” $f\colon \mathbb{R}^n \to X$ . This is called “functorial geometry”.

Notice that the popular concept of a smooth manifold (def./prop. below) is essentially that of a smooth space which locally looks just like a Cartesian space, in that there exist sufficiently many $f \colon \mathbb{R}^n \to X$ which are (open) isomorphisms onto their images. Historically it was a long process to arrive at the insight that it is wrong to fix such local coordinate identifications $f$ , or to have any structure depend on such a choice. But it is useful to go one step further:

In functorial geometry we do not even focus attention on those $f \colon \mathbb{R}^n \to X$ that are isomorphisms onto their image, but consider all “probes” of $X$ by “abstract coordinate systems”. This makes differential geometry both simpler as well as more powerful. The analogous insight for algebraic geometry is due to Grothendieck 65; it was transported to differential geometry by Lawvere 67.

This allows to combine the best of two superficially disjoint worlds: On the one hand we may reduce all constructions and computations to coordinates, the way traditionally done in the physics literature; on the other hand we have full conceptual control over the coordinate-free generalized spaces analyzed thereby. What makes this work is that all coordinate-constructions are functorially considered over all abstract coordinate systems.

$\,$

Definition

(Cartesian spaces and smooth functions between them)

For $n \in \mathbb{N}$ we say that the set $\mathbb{R}^n$ of n-tuples of real numbers is a Cartesian space. This comes with the canonical coordinate functions

x^k \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}

which send an n-tuple of real numbers to the $k$ th element in the tuple, for $k \in \{1, \cdots, n\}$ .

For

f \;\colon\; \mathbb{R}^{n} \longrightarrow \mathbb{R}^{n'}

any function between Cartesian spaces, we may ask whether its partial derivative along the $k$ th coordinate exists, denoted

\frac{\partial f}{\partial x^k} \;\colon\; \mathbb{R}^{n} \longrightarrow \mathbb{R}^{n'} \,.

If this exists, we may in turn ask that the partial derivative of the partial derivative exists

\frac{\partial^2 f}{\partial x^{k_1} \partial x^{k_2}} \coloneqq \frac{\partial}{\partial x^{k_2}} \frac{\partial f}{\partial x^{k_1}}

and so on.

A general higher partial derivative obtained this way is, if it exists, indexed by an n-tuple of natural numbers $\alpha \in \mathbb{N}^n$ and denoted

(1)

\partial_\alpha \;\coloneqq\; \frac{ \partial^{\vert \alpha \vert} f }{ \partial^{\alpha_1} x^1 \partial^{\alpha_2} x^2 \cdots \partial^{\alpha_n} x^n } \,,

where ${\vert \alpha\vert} \coloneqq \underoverset{n}{i = 1}{\sum} \alpha_i$ is the total order of the partial derivative.

If all partial derivative to all orders $\alpha \in \mathbb{N}^n$ of a function $f \colon \mathbb{R}^n \to \mathbb{R}^{n'}$ exist, then $f$ is called a smooth function.

Of course the composition $g \circ f$ of two smooth functions is again a smooth function.

\array{ && \mathbb{R}^{n_2} \\ & {}^{\mathllap{f}}\nearrow && \searrow^{\mathrlap{g}} \\ \mathbb{R}^{n_1} && \underset{g \circ f}{\longrightarrow} && \mathbb{R}^{n_3} } \,.

The inclined reader may notice that this means that Cartesian spaces with smooth functions between them constitute a category (“CartSp”); but the reader not so inclined may ignore this.

For the following it is useful to think of each Cartesian space as an abstract coordinate system. We will be dealing with various generalized smooth spaces (see the table below), but they will all be characterized by a prescription for how to smoothly map abstract coordinate systems into them.

Example

(coordinate functions are smooth functions)

Given a Cartesian space $\mathbb{R}^n$ , then all its coordinate functions (def. )

x^k \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}

are smooth functions (def. ).

For

f \colon \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}

any smooth function and $a \in \{1, 2, \cdots, n_2\}$ write

f^a \coloneqq x^a \circ f \;\colon\; \mathbb{R}^{n_1} \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \overset{x^a}{\longrightarrow} \mathbb{R}

. for its composition with this coordinate function.

Example

(algebra of smooth functions on Cartesian spaces)

For each $n \in \mathbb{N}$ , the set

C^\infty(\mathbb{R}^n) \;\coloneqq\; Hom_{CartSp}(\mathbb{R}^n, \mathbb{R})

of real number-valued smooth functions $f \colon \mathbb{R}^n \to \mathbb{R}$ on the $n$ -dimensional Cartesian space (def. ) becomes a commutative associative algebra over the ring of real numbers by pointwise addition and multiplication in $\mathbb{R}$ : for $f,g \in C^\infty(\mathbb{R}^n)$ and $x \in \mathbb{R}^n$

$(f + g)(x) \coloneqq f(x) + g(x)$
$(f \cdot g)(x) \coloneqq f(x) \cdot g(x)$ .

The inclusion

\mathbb{R} \overset{const}{\hookrightarrow} C^\infty(\mathbb{R}^n)

is given by the constant functions.

We call this the real algebra of smooth functions on $\mathbb{R}^n$ :

C^\infty(\mathbb{R}^n) \;\in\; \mathbb{R} Alg \,.

f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}

is any smooth function (def. ) then pre-composition with $f$ (“pullback of functions”)

\array{ C^\infty(\mathbb{R}^{n_2}) &\overset{f^\ast}{\longrightarrow}& C^\infty(\mathbb{R}^{n_1}) \\ g &\mapsto& f^\ast g \coloneqq g \circ f }

is an algebra homomorphism. Moreover, this is clearly compatible with composition in that

f_1^\ast(f_2^\ast g) = (f_2 \circ f_1)^\ast g \,.

Stated more abstractly, this means that assigning algebras of smooth functions is a functor

C^\infty(-) \;\colon\; CartSp \longrightarrow \mathbb{R} Alg^{op}

from the category CartSp of Cartesian spaces and smooth functions between them (def. ), to the opposite of the category $\mathbb{R}$ Alg of $\mathbb{R}$ -algebras.

Definition

(local diffeomorphisms and open embeddings of Cartesian spaces)

A smooth function $f \colon \mathbb{R}^{n} \to \mathbb{R}^{n}$ from one Cartesian space to itself (def. ) is called a local diffeomorphism, denoted

f \;\colon\; \mathbb{R}^{n} \overset{et}{\longrightarrow} \mathbb{R}^n

if the determinant of the matrix of partial derivatives (the “Jacobian” of $f$ ) is everywhere non-vanishing

det \left( \array{ \frac{\partial f^1}{\partial x^1}(x) &\cdots& \frac{\partial f^n}{\partial x^1}(x) \\ \vdots && \vdots \\ \frac{\partial f^1}{\partial x^n}(x) &\cdots& \frac{\partial f^n}{\partial x^n}(x) } \right) \;\neq\; 0 \phantom{AAAA} \text{for all} \, x \in \mathbb{R}^n \,.

If the function $f$ is both a local diffeomorphism, as above, as well as an injective function then we call it an open embedding, denoted

f \;\colon\; \mathbb{R}^n \overset{\phantom{A}et\phantom{A}}{\hookrightarrow} \mathbb{R}^n \,.

Definition

(good open cover of Cartesian spaces)

For $\mathbb{R}^n$ a Cartesian space (def. ), a differentiably good open cover is

an indexed set

$\left\{ \mathbb{R}^n \underoverset{et}{\phantom{AA}f_i\phantom{AA}}{\hookrightarrow} \mathbb{R}^n \right\}_{i \in I}$

of open embeddings (def. )

such that the images

U_i \coloneqq im(f_i) \subset \mathbb{R}^n

satisfy:

(open cover) every point of $\mathbb{R}^n$ is contained in at least one of the $U_i$ ;
(good) all finite intersections $U_{i_1} \cap \cdots \cap U_{i_k} \subset \mathbb{R}^n$ are either empty set or themselves images of open embeddings according to def. .

The inclined reader may notice that the concept of differentiably good open covers from def. is a coverage on the category CartSp of Cartesian spaces with smooth functions between them, making it a site, but the reader not so inclined may ignore this.

(Fiorenza-Schreiber-Stasheff 12, def. 6.3.9)

$\,$

fiber bundles

Given any context of objects and morphisms between them, such as the Cartesian spaces and smooth functions from def. it is of interest to fix one object $X$ and consider other objects parameterized over it. These are called bundles (def. ) below. For reference, we briefly discuss here the basic concepts related to bundles in the context of Cartesian spaces.

Of course the theory of bundles is mostly trivial over Cartesian spaces; it gains its main interest from its generalization to more general smooth manifolds (def./prop. below). It is still worthwhile for our development to first consider the relevant concepts in this simple case first.

For more exposition see at fiber bundles in physics.

$\,$

Definition

(bundles)

We say that a smooth function $E \overset{fb}{\to} X$ (def. ) is a bundle just to amplify that we think of it as exhibiting $E$ as being a “space over $X$ ”:

\array{ E \\ \downarrow\mathrlap{fb} \\ X } \,.

For $x \in X$ a point, we say that the fiber of this bundle over $x$ is the pre-image

(2)

E_x \coloneqq fb^{-1}(\{x\}) \subset E

of the point $x$ under the smooth function. We think of $fb$ as exhibiting a “smoothly varying” set of fiber spaces over $X$ .

Given two bundles $E_1 \overset{fb_1}{\to} X$ and $E_2 \overset{fb_2}{\to} X$ over $X$ , a homomorphism of bundles between them is a smooth function $f \colon E_1 \to E_2$ (def. ) between their total spaces which respects the bundle projections, in that

fb_2 \circ f = fb_1 \phantom{AAAA} \text{i.e.} \phantom{AAA} \array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X } \,.

Hence a bundle homomorphism is a smooth function that sends fibers to fibers over the same point:

f\left( (E_1)_x \right) \;\subset\; (E_2)_x \,.

The inclined reader may notice that this defines a category of bundles over $X$ , which is in fact just the slice category $CartSp_{/X}$ ; the reader not so inclined may ignore this.

Definition

(sections)

Given a bundle $E \overset{fb}{\to} X$ (def. ) a section is a smooth function $s \colon X \to E$ such that

fb \circ s = id_X \phantom{AAAAA} \array{ && E \\ & {}^{\mathllap{s}}\nearrow & \downarrow\mathrlap{fb} \\ X &=& X } \,.

This means that $s$ sends every point $x \in X$ to an element in the fiber over that point

s(x) \in E_x \,.

We write

\Gamma_X(E) \coloneqq \left\{ \array{ && E \\ & {}^{\mathllap{s}}\nearrow & \downarrow^\mathrlap{fb} \\ X &=& X } \phantom{fb} \right\}

for the set of sections of a bundle.

For $E_1 \overset{f_1}{\to} X$ and $E_2 \overset{f_2}{\to} X$ two bundles and for

\array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X }

a bundle homomorphism between them (def. ), then composition with $f$ sends sections to sections and hence yields a function denoted

\array{ \Gamma_X(E_1) &\overset{f_\ast}{\longrightarrow}& \Gamma_X(E_2) \\ s &\mapsto& f \circ s } \,.

Example

(trivial bundle)

For $X$ and $F$ Cartesian spaces, then the Cartesian product $X \times F$ equipped with the projection

\array{ X \times F \\ \downarrow^\mathrlap{pr_1} \\ X }

to $X$ is a bundle (def. ), called the trivial bundle with fiber $F$ . This represents the constant smoothly varying set of fibers, constant on $F$

If $F = \ast$ is the point, then this is the identity bundle

\array{ X \\ \downarrow\mathrlap{id} \\ X } \,.

Given any bundle $E \overset{fb}{\to} X$ , then a bundle homomorphism (def. ) from the identity bundle to $E \overset{fb}{\to} X$ is equivalently a section of $E \overset{fb}{\to} X$ (def. )

\array{ X && \overset{s}{\longrightarrow} && E \\ & {}_{\mathllap{id}}\searrow && \swarrow_{\mathrlap{fb}} \\ && X }

Definition

(fiber bundle)

A bundle $E \overset{fb}{\to} X$ (def. ) is called a fiber bundle with typical fiber $F$ if there exists a differentiably good open cover $\{U_i \hookrightarrow X\}_{i \in I}$ (def. ) such that the restriction of $fb$ to each $U_i$ is isomorphic to the trivial fiber bundle with fiber $F$ over $U_i$ . Such diffeomorphisms $f_i \colon U_i \times F \overset{\simeq}{\to} E\vert_{U_i}$ are called local trivializations of the fiber bundle:

\array{ U_i \times F &\underoverset{\simeq}{f_i}{\longrightarrow}& E\vert_{U_i} \\ & {}_{\mathllap{pr_1}}\searrow & \downarrow\mathrlap{fb\vert_{U_i}} \\ && U_i } \,.

Definition

(vector bundle)

A vector bundle is a fiber bundle $E \overset{vb}{\to} X$ (def. ) with typical fiber a vector space $V$ such that there exists a local trivialization $\{U_i \times V \underoverset{\simeq}{f_i}{\to} E\vert_{U_i}\}_{i \in I}$ whose gluing functions

U_i \cap U_j \times V \overset{f_i\vert_{U_i \cap U_j}}{\longrightarrow} E\vert_{U_i \cap U_j} \overset{f_j^{-1}\vert_{U_i \cap U_j}}{\longrightarrow} U_i \cap U_j \times V

for all $i,j \in I$ are linear functions over each point $x \in U_i \cap U_j$ .

A homomorphism of vector bundle is a bundle morphism $f$ (def. ) such that there exist local trivializations on both sides with respect to which $g$ is fiber-wise a linear map.

The inclined reader may notice that this makes vector bundles over $X$ a category (denoted $Vect_{/X}$ ); the reader not so inclined may ignore this.

Example

(module of sections of a vector bundle)

Given a vector bundle $E \overset{vb}{\to} X$ (def. ), then its set of sections $\Gamma_X(E)$ (def. ) becomes a real vector space by fiber-wise multiplication with real numbers. Moreover, it becomes a module over the algebra of smooth functions $C^\infty(X)$ (example ) by the same fiber-wise multiplication:

\array{ C^\infty(X) \otimes_{\mathbb{R}} \Gamma_X(E) &\longrightarrow& \Gamma_X(E) \\ (f,s) &\mapsto& (x \mapsto f(x) \cdot s(x)) } \,.

For $E_1 \overset{fb_1}{\to} X$ and $E_2 \overset{fb_2}{\to} X$ two vector bundles and

\array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X }

a vector bundle homomorphism (def. ) then the induced function on sections (def. )

f_\ast \;\colon\; \Gamma_X(E_1) \longrightarrow \Gamma_X(E_2)

is compatible with this action by smooth functions and hence constitutes a homomorphism of $C^\infty(X)$ -modules.

The inclined reader may notice that this means that taking spaces of sections yields a functor

\Gamma_X(-) \;\colon\; Vect_{/X} \longrightarrow C^\infty(X) Mod

from the category of vector bundles over $X$ to that over modules over $C^\infty(X)$ .

Example

(tangent vector fields and tangent bundle)

For $\mathbb{R}^n$ a Cartesian space (def. ) the trivial vector bundle (example , def. )

\array{ T \mathbb{R}^n &\coloneqq& \mathbb{R}^n \times \mathbb{R}^n \\ \mathllap{tb}\downarrow && \downarrow\mathrlap{pr_1} \\ \mathbb{R}^n &=& \mathbb{R}^n }

is called the tangent bundle of $\mathbb{R}^n$ . With $(x^a)_{a = 1}^n$ the coordinate functions on $\mathbb{R}^n$ (def. ) we write $(\partial_a)_{a = 1}^n$ for the corresponding linear basis of $\mathbb{R}^n$ regarded as a vector space. Then a general section (def. )

\array{ && T \mathbb{R}^n \\ & {}^{\mathllap{v}}\nearrow& \downarrow\mathrlap{tb} \\ \mathbb{R}^n &=& \mathbb{R}^n }

of the tangent bundle has a unique expansion of the form

v = v^a \partial_a

where a sum over indices is understood (Einstein summation convention) and where the components $(v^a \in C^\infty(\mathbb{R}^n))_{a = 1}^n$ are smooth functions on $\mathbb{R}^n$ (def. ).

Such a $v$ is also called a smooth tangent vector field on $\mathbb{R}^n$ .

Each tangent vector field $v$ on $\mathbb{R}^n$ determines a partial derivative on smooth functions

\array{ C^\infty(\mathbb{R}^n) &\overset{D_v}{\longrightarrow}& C^\infty(\mathbb{R}^n) \\ f &\mapsto& \mathrlap{ D_v f \coloneqq v^a \partial_a (f) \coloneqq \sum_a v^a \frac{\partial f}{\partial x^a} } } \,.

By the product law of differentiation, this is a derivation on the algebra of smooth functions (example ) in that

it is an $\mathbb{R}$ -linear map in that

$D_v( c_1 f_1 + c_2 f_2 ) = c_1 D_v f_1 + c_2 D_v f_2$
it satisfies the Leibniz rule

$D_v(f_1 \cdot f_2) = (D_v f_1) \cdot f_2 + f_1 \cdot (D_v f_2)$

for all $c_1, c_2 \in \mathbb{R}$ and all $f_1, f_2 \in C^\infty(\mathbb{R}^n)$ .

Hence regarding tangent vector fields as partial derivatives constitutes a linear function

D \;\colon\; \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) \longrightarrow Der(C^\infty(\mathbb{R}^n))

from the space of sections of the tangent bundle. In fact this is a homomorphism of $C^\infty(\mathbb{R}^n)$ -modules (example ), in that for $f \in C^\infty(\mathbb{R}^n)$ and $v \in \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n)$ we have

D_{f v}(-) = f \cdot D_v(-) \,.

Example

(vertical tangent bundle)

Let $E \overset{fb}{\to} \Sigma$ be a fiber bundle. Then its vertical tangent bundle $T_\Sigma E \overset{T fb}{\to} \Sigma$ is the fiber bundle (def. ) over $\Sigma$ whose fiber over a point is the tangent bundle (def. ) of the fiber of $E \overset{fb}{\to}\Sigma$ over that point:

(T_\Sigma E)_x \coloneqq T(E_x) \,.

If $E \simeq \Sigma \times F$ is a trivial fiber bundle with fiber $F$ , then its vertical vector bundle is the trivial fiber bundle with fiber $T F$ .

Definition

(dual vector bundle)

For $E \overset{vb}{\to} \Sigma$ a vector bundle (def. ), its dual vector bundle is the vector bundle whose fiber (2) over $x \in \Sigma$ is the dual vector space of the corresponding fiber of $E \to \Sigma$ :

(E^\ast)_x \;\coloneqq\; (E_x)^\ast \,.

The defining pairing of dual vector spaces $(E_x)^\ast \otimes E_x \to \mathbb{R}$ applied pointwise induces a pairing on the modules of sections (def. ) of the original vector bundle and its dual with values in the smooth functions (def. ):

(3)

\array{ \Gamma_\Sigma(E) \otimes_{C^\infty(X)} \Gamma_\Sigma(E^\ast) &\longrightarrow& C^\infty(\Sigma) \\ (v,\alpha) &\mapsto& (v \cdot \alpha \colon x \mapsto \alpha_x(v_x) ) }

$\,$

synthetic differential geometry

Below we encounter generalizations of ordinary differential geometry that include explicit “infinitesimals” in the guise of infinitesimally thickened points, as well as “super-graded infinitesimals”, in the guise of superpoints (necessary for the description of fermion fields such as the Dirac field). As we discuss below, these structures are naturally incorporated into differential geometry in just the same way as Grothendieck introduced them into algebraic geometry (in the guise of “formal schemes”), namely in terms of formally dual rings of functions with nilpotent ideals. That this also works well for differential geometry rests on the following three basic but important properties, which say that smooth functions behave “more algebraically” than their definition might superficially suggest:

Proposition

(the three magic algebraic properties of differential geometry)

embedding of Cartesian spaces into formal duals of R-algebras

For $X$ and $Y$ two Cartesian spaces, the smooth functions $f \colon X \longrightarrow Y$ between them (def. ) are in natural bijection with their induced algebra homomorphisms $C^\infty(X) \overset{f^\ast}{\longrightarrow} C^\infty(Y)$ (example ), so that one may equivalently handle Cartesian spaces entirely via their $\mathbb{R}$ -algebras of smooth functions.

Stated more abstractly, this means equivalently that the functor $C^\infty(-)$ that sends a smooth manifold $X$ to its $\mathbb{R}$ -algebra $C^\infty(X)$ of smooth functions (example ) is a fully faithful functor:

$C^\infty(-) \;\colon\; SmthMfd \overset{\phantom{AAAA}}{\hookrightarrow} \mathbb{R} Alg^{op} \,.$

(Kolar-Slovak-Michor 93, lemma 35.8, corollaries 35.9, 35.10)
embedding of smooth vector bundles into formal duals of R-algebra modules

For $E_1 \overset{vb_1}{\to} X$ and $E_2 \overset{vb_2}{\to} X$ two vector bundle (def. ) there is then a natural bijection between vector bundle homomorphisms $f \colon E_1 \to E_2$ and the homomorphisms of modules $f_\ast \;\colon\; \Gamma_X(E_1) \to \Gamma_X(E_2)$ that these induces between the spaces of sections (example ).

More abstractly this means that the functor $\Gamma_X(-)$ is a fully faithful functor

$\Gamma_X(-) \;\colon\; VectBund_X \overset{\phantom{AAAA}}{\hookrightarrow} C^\infty(X) Mod$

(Nestruev 03, theorem 11.29)

Moreover, the modules over the $\mathbb{R}$ -algebra $C^\infty(X)$ of smooth functions on $X$ which arise this way as sections of smooth vector bundles over a Cartesian space $X$ are precisely the finitely generated free modules over $C^\infty(X)$ .

(Nestruev 03, theorem 11.32)
vector fields are derivations of smooth functions.

For $X$ a Cartesian space (example ), then any derivation $D \colon C^\infty(X) \to C^\infty(X)$ on the $\mathbb{R}$ -algebra $C^\infty(X)$ of smooth functions (example ) is given by differentiation with respect to a uniquely defined smooth tangent vector field: The function that regards tangent vector fields with derivations from example

$\array{ \Gamma_X(T X) &\overset{\phantom{A}\simeq\phantom{A}}{\longrightarrow}& Der(C^\infty(X)) \\ v &\mapsto& D_v }$

is in fact an isomorphism.

(This follows directly from the Hadamard lemma.)

Actually all three statements in prop. hold not just for Cartesian spaces, but generally for smooth manifolds (def./prop. below; if only we generalize in the second statement from free modules to projective modules. However for our development here it is useful to first focus on just Cartesian spaces and then bootstrap the theory of smooth manifolds and much more from that, which we do below.

$\,$

differential forms

We introduce and discuss differential forms on Cartesian spaces.

Definition

(differential 1-forms on Cartesian spaces and the cotangent bundle)

For $n \in \mathbb{N}$ a smooth differential 1-form $\omega$ on a Cartesian space $\mathbb{R}^n$ (def. ) is an n-tuple

\left(\omega_i \in CartSp\left(\mathbb{R}^n,\mathbb{R}\right)\right)_{i = 1}^n

of smooth functions (def. ), which we think of equivalently as the coefficients of a formal linear combination

\omega = \omega_i d x^i

on a set $\{d x^1, d x^2, \cdots, d x^n\}$ of cardinality $n$ .

Here a sum over repeated indices is tacitly understood (Einstein summation convention).

Write

\Omega^1(\mathbb{R}^k) \simeq CartSp(\mathbb{R}^k, \mathbb{R})^{\times k}\in Set

for the set of smooth differential 1-forms on $\mathbb{R}^k$ .

We may think of the expressions $(d x^a)_{a = 1}^n$ as a linear basis for the dual vector space $\mathbb{R}^n$ . With this the differential 1-forms are equivalently the sections (def. ) of the trivial vector bundle (example , def. )

\array{ T^\ast \mathbb{R}^n &\coloneqq& \mathbb{R}^n \times (\mathbb{R}^n)^\ast \\ \mathllap{cb}\downarrow && \downarrow\mathrlap{pr_1} \\ \mathbb{R}^n &=& \mathbb{R}^n }

called the cotangent bundle of $\mathbb{R}^n$ (def. ):

\Omega^1(\mathbb{R}^n) = \Gamma_{\mathbb{R}^n}(T^\ast \mathbb{R}^n) \,.

This amplifies via example that $\Omega^1(\mathbb{R}^n)$ has the structure of a module over the algebra of smooth functions $C^\infty(\mathbb{R}^n)$ , by the evident multiplication of differential 1-forms with smooth functions:

The set $\Omega^1(\mathbb{R}^k)$ of differential 1-forms in a Cartesian space (def. ) is naturally an abelian group with addition given by componentwise addition

$\begin{aligned} \omega + \lambda & = \omega_i d x^i + \lambda_i d x^i \\ & = (\omega_i + \lambda_i) d x^i \end{aligned} \,,$
The abelian group $\Omega^1(\mathbb{R}^k)$ is naturally equipped with the structure of a module over the algebra of smooth functions $C^\infty(\mathbb{R}^k)$ (example ), where the action $C^\infty(\mathbb{R}^k) \times\Omega^1(\mathbb{R}^k) \to \Omega^1(\mathbb{R}^k)$ is given by componentwise multiplication

$f \cdot \omega = ( f \cdot \omega_i) d x^i \,.$

Accordingly there is a canonical pairing between differential 1-forms and tangent vector fields (example )

(4)

\array{ \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) \otimes_{\mathbb{R}} \Gamma_{\mathbb{R}^n}(T \ast \mathbb{R}^n) &\overset{\iota_{(-)}(-) }{\longrightarrow}& C^\infty(\mathbb{R}^n) \\ (v,\omega) &\mapsto& \mathrlap{ \iota_v \omega \coloneqq v^a \omega_a } }

With differential 1-forms in hand, we may collect all the first-order partial derivatives of a smooth function into a single object: the exterior derivative or de Rham differential is the $\mathbb{R}$ -linear function

(5)

\array{ C^\infty(\mathbb{R}^n) &\overset{d}{\longrightarrow}& \Omega^1(\mathbb{R}^n) \\ f &\mapsto& \mathrlap{ d f \coloneqq \frac{\partial f}{ \partial x^a} d x^a } } \,.

Under the above pairing with tangent vector fields $v$ this yields the particular partial derivative along $v$ :

\iota_v d f = D_v f = v^a \frac{\partial f}{\partial x^a} \,.

We think of $d x^i$ as a measure for infinitesimal displacements along the $x^i$ -coordinate of a Cartesian space. If we have a measure of infintesimal displacement on some $\mathbb{R}^n$ and a smooth function $f \colon \mathbb{R}^{\tilde n} \to \mathbb{R}^n$ , then this induces a measure for infinitesimal displacement on $\mathbb{R}^{\tilde n}$ by sending whatever happens there first with $f$ to $\mathbb{R}^n$ and then applying the given measure there. This is captured by the following definition:

Definition

(pullback of differential 1-forms)

For $\phi \colon \mathbb{R}^{\tilde k} \to \mathbb{R}^k$ a smooth function, the pullback of differential 1-forms along $\phi$ is the function

\phi^* \colon \Omega^1(\mathbb{R}^{k}) \to \Omega^1(\mathbb{R}^{\tilde k})

between sets of differential 1-forms, def. , which is defined on basis-elements by

\phi^* d x^i \;\coloneqq\; \frac{\partial \phi^i}{\partial \tilde x^j} d \tilde x^j

and then extended linearly by

\begin{aligned} \phi^* \omega & = \phi^* \left( \omega_i d x^i \right) \\ & \coloneqq \left(\phi^* \omega\right)_i \frac{\partial \phi^i }{\partial \tilde x^j} d \tilde x^j \\ & = (\omega_i \circ \phi) \cdot \frac{\partial \phi^i }{\partial \tilde x^j} d \tilde x^j \end{aligned} \,.

This is compatible with identity morphisms and composition in that

(6)

(id_{\mathbb{R}^n})^\ast = id_{\Omega^1(\mathbb{R}^n)} \phantom{AAAA} (g \circ f)^\ast = f^\ast \circ g^\ast \,.

Stated more abstractly, this just means that pullback of differential 1-forms makes the assignment of sets of differential 1-forms to Cartesian spaces a contravariant functor

\Omega^1(-) \;\colon\; CartSp^{op} \longrightarrow Set \,.

The following definition captures the idea that if $d x^i$ is a measure for displacement along the $x^i$ -coordinate, and $d x^j$ a measure for displacement along the $x^j$ coordinate, then there should be a way to get a measure, to be called $d x^i \wedge d x^j$ , for infinitesimal surfaces (squares) in the $x^i$ - $x^j$ -plane. And this should keep track of the orientation of these squares, with

d x^j \wedge d x^i = - d x^i \wedge d x^j

being the same infinitesimal measure with orientation reversed.

Definition

(exterior algebra of differential n-forms)

For $k,n \in \mathbb{N}$ , the smooth differential forms on a Cartesian space $\mathbb{R}^k$ (def. ) is the exterior algebra

\Omega^\bullet(\mathbb{R}^k) \coloneqq \wedge^\bullet_{C^\infty(\mathbb{R}^k)} \Omega^1(\mathbb{R}^k)

over the algebra of smooth functions $C^\infty(\mathbb{R}^k)$ (example ) of the module $\Omega^1(\mathbb{R}^k)$ of smooth 1-forms.

We write $\Omega^n(\mathbb{R}^k)$ for the sub-module of degree $n$ and call its elements the differential n-forms.

Explicitly this means that a differential n-form $\omega \in \Omega^n(\mathbb{R}^k)$ on $\mathbb{R}^k$ is a formal linear combination over $C^\infty(\mathbb{R}^k)$ (example ) of basis elements of the form $d x^{i_1} \wedge \cdots \wedge d x^{i_n}$ for $i_1 \lt i_2 \lt \cdots \lt i_n$ :

\omega = \omega_{i_1, \cdots, i_n} d x^{i_1} \wedge \cdots \wedge d x^{i_n} \,.

Now all the constructions for differential 1-forms above extent naturally to differential n-forms:

Definition

(exterior derivative or de Rham differential)

For $\mathbb{R}^n$ a Cartesian space (def. ) the de Rham differential $d \colon C^\infty(\mathbb{R}^n) \to \Omega^1(\mathbb{R}^n)$ (5) uniquely extended as a derivation of degree +1 to the exterior algebra of differential forms (def. )

d \;\colon\; \Omega^\bullet(\mathbb{R}^n) \longrightarrow \Omega^\bullet(\mathbb{R}^n)

meaning that for $\omega_i \in \Omega^{k_i}(\mathbb{R})$ then

d(\omega_1 \wedge \omega_2) \;=\; (d \omega_1) \wedge \omega_2 + \omega_1 \wedge d \omega_2 \,.

In components this simply means that

\begin{aligned} d \omega & = d \left(\omega_{i_1 \cdots i_k} d x^{i_1} \wedge \cdots \wedge d x^{i_k}\right) \\ & = \frac{\partial \omega_{i_1 \cdots i_k}}{\partial x^{a}} d x^a \wedge d x^{i_1} \wedge \cdots \wedge d x^{i_k} \end{aligned} \,.

Since partial derivatives commute with each other, while differential 1-form anti-commute, this implies that $d$ is nilpotent

d^2 = d \circ d = 0 \,.

We say hence that differential forms form a cochain complex, the de Rham complex $(\Omega^\bullet(\mathbb{R}^n), d)$ .

Definition

(contraction of differential n-forms with tangent vector fields)

The pairing $\iota_v \omega = \omega(v)$ of tangent vector fields $v$ with differential 1-forms $\omega$ (4) uniquely extends to the exterior algebra $\Omega^\bullet(\mathbb{R}^n)$ of differential forms (def. ) as a derivation of degree -1

\iota_v \;\colon\; \Omega^{\bullet+1}(\mathbb{R}^n) \longrightarrow \Omega^\bullet(\mathbb{R}^n) \,.

In particular for $\omega_1, \omega_2 \in \Omega^1(\mathbb{R}^n)$ two differential 1-forms, then

\iota_{v} (\omega_1 \wedge \omega_2) \;=\; \omega_1(v) \omega_2 - \omega_2(v) \omega_1 \;\in\; \Omega^1(\mathbb{R}^n) \,.

Proposition

(pullback of differential n-forms)

For $f \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2}$ a smooth function between Cartesian spaces (def. ) the operationf of pullback of differential 1-forms of def. extends as an $C^\infty(\mathbb{R}^k)$ -algebra homomorphism to the exterior algebra of differential forms (def. ),

f^\ast \;\colon\; \Omega^\bullet(\mathbb{R}^{n_2}) \longrightarrow \Omega^\bullet(\mathbb{R}^{n_1})

given on basis elements by

f^* \left( dx^{i_1} \wedge \cdots \wedge dx^{i_n} \right) = \left(f^* dx^{i_1} \wedge \cdots \wedge f^* dx^{i_n} \right) \,.

This commutes with the de Rham differential $d$ on both sides (def. ) in that

d \circ f^\ast = f^\ast \circ d \phantom{AAAAA} \array{ \Omega^\bullet(X) &\overset{f^\ast}{\longleftarrow}& \Omega^\bullet(Y) \\ \mathllap{d}\downarrow && \downarrow\mathrlap{d} \\ \Omega^\bullet(X) &\underset{f^\ast}{\longleftarrow}& \Omega^\bullet(Y) }

hence that pullback of differential forms is a chain map of de Rham complexes.

This is still compatible with identity morphisms and composition in that

(7)

(id_{\mathbb{R}^n})^\ast = id_{\Omega^1(\mathbb{R}^n)} \phantom{AAAA} (g \circ f)^\ast = f^\ast \circ g^\ast \,.

Stated more abstractly, this just means that pullback of differential n-forms makes the assignment of sets of differential n-forms to Cartesian spaces a contravariant functor

\Omega^n(-) \;\colon\; CartSp^{op} \longrightarrow Set \,.

Proposition

(Cartan's homotopy formula)

Let $X$ be a Cartesian space (def. ), and let $v \in \Gamma(T X)$ be a smooth tangent vector field (example ).

For $t \in \mathbb{R}$ write $\exp(t v) \colon X \overset{\simeq}{\to} X$ for the flow by diffeomorphisms along $v$ of parameter length $t$ .

Then the derivative with respect to $t$ of the pullback of differential forms along $\exp(t v)$ , hence the Lie derivative $\mathcal{L}_v \colon \Omega^\bullet(X) \to \Omega^\bullet(X)$ , is given by the anticommutator of the contraction derivation $\iota_v$ (def. ) with the de Rham differential $d$ (def. ):

\begin{aligned} \mathcal{L}_v &\coloneqq \frac{d}{d t } \exp(t v)^\ast \omega \vert_{t = 0} \\ & = \iota_v d \omega + d \iota_v \omega \,. \end{aligned}

Finally we turn to the concept of integration of differential forms (def. below). First we need to say what it is that differential forms may be integrated over:

Definition

(smooth singular simplicial chains in Cartesian spaces)

For $n \in \mathbb{N}$ , the standard n-simplex in the Cartesian space $\mathbb{R}^n$ (def. ) is the subset

\Delta^n \;\coloneqq\; \left\{ (x^i)_{i = 1}^n \;\vert\; 0 \leq x^1 \leq \cdots \leq x^n \right\} \;\subset\; \mathbb{R}^n \,.

More generally, a smooth singular n-simplex in a Cartesian space $\mathbb{R}^k$ is a smooth function (def. )

\sigma \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}^k \,,

to be thought of as a smooth extension of its restriction

\sigma\vert_{\Delta^n} \;\colon\; \Delta^n \longrightarrow \mathbb{R}^k \,.

(This is called a singular simplex because there is no condition that $\Sigma$ be an embedding in any way, in particular $\sigma$ may be a constant function.)

A singular chain in $\mathbb{R}^k$ of dimension $n$ is a formal linear combination of singular $n$ -simplices in $\mathbb{R}^k$ .

In particular, given a singular $n+1$ -simplex $\sigma$ , then its boundary is a singular chain of singular $n$ -simplices $\partial \sigma$ .

Definition

(fiber-integration of differential forms) over smooth singular chains in Cartesian spaces)

For $n \in \mathbb{N}$ and $\omega \in \Omega^n(\mathbb{R}^n)$ a differential n-form (def. ), which may be written as

\omega = f d x^1 \wedge \cdots d x^n \,,

then its integration over the standard n-simplex $\Delta^n \subset \mathbb{R}^n$ (def. ) is the ordinary integral (e.g. Riemann integral)

\int_{\Delta^n} \omega \;\coloneqq\; \underset{0 \leq x^1 \leq \cdots \leq x^n \leq 1}{\int} f(x^1, \cdots, x^n) \, d x^1 \cdots d x^n \,.

More generally, for

$\omega \in \Omega^n(\mathbb{R}^k)$ a differential n-forms;
$C = \underset{i}{\sum} c_i \sigma_i$ a singular $n$ -chain (def. )

in any Cartesian space $\mathbb{R}^k$ . Then the integration of $\omega$ over $x$ is the sum of the integrations, as above, of the pullback of differential forms (def. ) along all the singular n-simplices in the chain:

\int_C \omega \;\coloneqq\; \underset{i}{\sum} c_i \int_{\Delta^n} (\sigma_i)^\ast \omega \,.

Finally, for $U$ another Cartesian space, then fiber integration of differential forms along $U \times C \to U$ is the linear map

\int_C \;\colon\; \Omega^{\bullet + dim(C)}(U \times C) \longrightarrow \Omega^\bullet(U)

which on differential forms of the form $\omega_U \wedge \omega$ with ( $\omega_U$ pulled back from $U$ and $\omega$ from $C$ ) is given by:

\int_C \omega_U \wedge \omega \;\coloneqq\; (-1)^{\vert \omega_U\vert} \Big( \textstyle{\int}_C \omega \Big) \omega_U \,.

Proposition

(Stokes theorem for fiber-integration of differential forms)

For $\Sigma$ a smooth singular simplicial chain (def. ) the operation of fiber-integration of differential forms along $U \times \Sigma \overset{pr_1}{\longrightarrow} U$ (def. ) is compatible with the exterior derivative $d_U$ on $U$ (def. ) in that

\begin{aligned} d \int_\Sigma \omega & = (-1)^{dim(\Sigma)} \int_\Sigma d_U \omega \\ & = (-1)^{dim(\Sigma)} \left( \int_\Sigma d \omega - \int_{\partial \Sigma} \omega \right) \end{aligned} \,,

where $d = d_U + d_\Sigma$ is the de Rham differential on $U \times \Sigma$ (def. ) and where the second equality is the Stokes theorem along $\Sigma$ :

\int_\Sigma d_\Sigma \omega = \int_{\partial \Sigma} \omega \,.

$\,$

This concludes our review of the basics of (synthetic) differential geometry on which the following development of quantum field theory is based. In the next chapter we consider spacetime and spin.

Spacetime

Relativistic field theory takes place on spacetime.

The concept of spacetime makes sense for every dimension $p+1$ with $p \in \mathbb{N}$ . The observable universe has macroscopic dimension $3+1$ , but quantum field theory generally makes sense also in lower and in higher dimensions. For instance quantum field theory in dimension 0+1 is the “worldline” theory of particles, also known as quantum mechanics; while quantum field theory in dimension $\gt p+1$ may be “KK-compactified” to an “effective” field theory in dimension $p+1$ which generally looks more complicated than its higher dimensional incarnation.

However, every realistic field theory, and also most of the non-realistic field theories of interest, contain spinor fields such as the Dirac field (example below) and the precise nature and behaviour of spinors does depend sensitively on spacetime dimension. In fact the theory of relativistic spinors is mathematically most natural in just the following four spacetime dimensions:

p +1 = \phantom{AAAAA} \array{ 2+1,\; & 3+1,\; & \, & 5+1,\; &\, & \, & \, & \, 9+1 }

In the literature one finds these four dimensions advertized for two superficially unrelated reasons:

in precisely these dimensions “twistors” exist (see there);
in precisely these dimensions “GS-superstrings” exist (see there).

However, both these explanations have a common origin in something simpler and deeper: Spacetime in these dimensions appears from the “Pauli matrices” with entries in the real normed division algebras. (In fact it goes deeper still, but this will not concern us here.)

This we explain now, and then we use this to obtain a slick handle on spinors in these dimensions, using simple linear algebra over the four real normed division algebras. At the end (in remark ) we give a dictionary that expresses these constructions in terms of the “two-component spinor notation” that is traditionally used in physics texts (remark below).

The relation between real spin representations and division algebras, is originally due to Kugo-Townsend 82, Sudbery 84 and others. We follow the streamlined discussion in Baez-Huerta 09 and Baez-Huerta 10.

A key extra structure that the spinors impose on the underlying Cartesian space of spacetime is its causal structure, which determines which points in spacetime (“events”) are in the future or the past of other points (def. below). This causal structure will turn out to tightly control the quantum field theory on spacetime in terms of the “causal additivity of the S-matrix” (prop. below) and the induced “causal locality” of the algebra of quantum observables (prop. below). To prepare the discussion of these constructions, we end this chapter with some basics on the causal structure of Minkowski spacetime.

$\,$

Real division algebras
Spacetime in dimensions 3, 4, 6 and 10
Lorentz group and Spin group
Spinors in dimensions 3, 4, 6 and 10
Causal structure

$\,$

Real division algebras

To amplify the following pattern and to fix our notation for algebra generators, recall these definitions:

Definition

(complex numbers)

The complex numbers $\mathbb{C}$ is the commutative algebra over the real numbers $\mathbb{R}$ which is generated from one generators $\{e_1\}$ subject to the relation

$(e_1)^2 = -1$ .

Definition

(quaternions)

The quaternions $\mathbb{H}$ is the associative algebra over the real numbers which is generated from three generators $\{e_1, e_2, e_3\}$ subject to the relations

for all $i$

$(e_i)^2 = -1$
for $(i,j,k)$ a cyclic permutation of $(1,2,3)$ then
1. $e_i e_j = e_k$
2. $e_j e_i = -e_k$

(graphics grabbed from Baez 02)

Definition

(octonions)

The octonions $\mathbb{O}$ is the nonassociative algebra over the real numbers which is generated from seven generators $\{e_1, \cdots, e_7\}$ subject to the relations

for all $i$

$(e_i)^2 = -1$
for $e_i \to e_j \to e_k$ an edge or circle in the diagram shown (a labeled version of the Fano plane) then
1. $e_i e_j = e_k$
2. $e_j e_i = -e_k$
and all relations obtained by cyclic permutation of the indices in these equations.

(graphics grabbed from Baez 02)

One defines the following operations on these real algebras:

Definition

(conjugation, real part, imaginary part and absolute value)

For $\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}, \mathbb{O}\}$ , let

(-)^\ast \;\colon\; \mathbb{K} \longrightarrow \mathbb{K}

be the antihomomorphism of real algebras

\begin{aligned} (r a)^\ast = r a^\ast &, \text{for}\;\; r \in \mathbb{R}, a \in \mathbb{K} \\ (a b)^\ast = b^\ast a^\ast &,\text{for}\;\; a,b \in \mathbb{K} \end{aligned}

given on the generators of def. , def. and def. by

(e_i)^\ast = - e_i \,.

This operation makes $\mathbb{K}$ into a star algebra. For the complex numbers $\mathbb{C}$ this is called complex conjugation, and in general we call it conjugation.

Let then

Re \;\colon\; \mathbb{K} \longrightarrow \mathbb{R}

be the function

Re(a) \;\coloneqq\; \tfrac{1}{2}(a + a^\ast)

(“real part”) and

Im \;\colon\; \mathbb{K} \longrightarrow \mathbb{R}

be the function

Im(a) \;\coloneqq \; \tfrac{1}{2}(a - a^\ast)

(“imaginary part”).

It follows that for all $a \in \mathbb{K}$ then the product of a with its conjugate is in the real center of $\mathbb{K}$

a a^\ast = a^\ast a \;\in \mathbb{R} \hookrightarrow \mathbb{K}

and we write the square root of this expression as

{\vert a\vert} \;\coloneqq\; \sqrt{a a^\ast}

called the norm or absolute value function

{\vert -\vert} \;\colon\; \mathbb{K} \longrightarrow \mathbb{R} \,.

This norm operation clearly satisfies the following properties (for all $a,b \in \mathbb{K}$ )

$\vert a \vert \geq 0$ ;
${\vert a \vert } = 0 \;\;\;\;\; \Leftrightarrow\;\;\;\;\;\; a = 0$ ;
${\vert a b \vert } = {\vert a \vert} {\vert b \vert}$

and hence makes $\mathbb{K}$ a normed algebra.

Since $\mathbb{R}$ is a division algebra, these relations immediately imply that each $\mathbb{K}$ is a division algebra, in that

a b = 0 \;\;\;\;\;\; \Rightarrow \;\;\;\;\;\; a = 0 \;\; \text{or} \;\; b = 0 \,.

Hence the conjugation operation makes $\mathbb{K}$ a real normed division algebra.

Remark

(sequence of inclusions of real normed division algebras)

Suitably embedding the sets of generators in def. , def. and def. into each other yields sequences of real star-algebra inclusions

\mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O} \,.

For example for the first two inclusions we may send each generator to the generator of the same name, and for the last inclusion me may choose

\array{ 1 &\mapsto& 1 \\ e_1 &\mapsto & e_3 \\ e_2 &\mapsto& e_4 \\ e_3 &\mapsto& e_6 }

Proposition

(Hurwitz theorem: $\mathbb{R}$ , $\mathbb{C}$ , $\mathbb{H}$ and $\mathbb{O}$ are the normed real division algebras)

The four algebras of real numbers $\mathbb{R}$ , complex numbers $\mathbb{C}$ , quaternions $\mathbb{H}$ and octonions $\mathbb{O}$ from def. , def. and def. respectively, which are real normed division algebras via def. , are, up to isomorphism, the only real normed division algebras that exist.

Remark

(Cayley-Dickson construction and sedenions)

While prop. says that the sequence from remark

\mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O}

is maximal in the category of real normed non-associative division algebras, there is a pattern that does continue if one disregards the division algebra property. Namely each step in this sequence is given by a construction called forming the Cayley-Dickson double algebra. This continues to an unbounded sequence of real nonassociative star-algebras

\mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O} \hookrightarrow \mathbb{S} \hookrightarrow \cdots

where the next algebra $\mathbb{S}$ is called the sedenions.

What actually matters for the following relation of the real normed division algebras to real spin representations is that they are also alternative algebras:

Definition

(alternative algebras)

Given any non-associative algebra $A$ , then the trilinear map

[-,-,-] \;-\; A \otimes A \otimes A \longrightarrow A

given on any elements $a,b,c \in A$ by

[a,b,c] \coloneqq (a b) c - a (b c)

is called the associator (in analogy with the commutator $[a,b] \coloneqq a b - b a$ ).

If the associator is completely antisymmetric (in that for any permutation $\sigma$ of three elements then $[a_{\sigma_1}, a_{\sigma_2}, a_{\sigma_3}] = (-1)^{\vert \sigma\vert} [a_1, a_2, a_3]$ for $\vert \sigma \vert$ the signature of the permutation) then $A$ is called an alternative algebra.

If the characteristic of the ground field is different from 2, then alternativity is readily seen to be equivalent to the conditions that for all $a,b \in A$ then

(a a)b = a (a b) \;\;\;\;\; \text{and} \;\;\;\;\; (a b) b = a (b b) \,.

We record some basic properties of associators in alternative star-algebras that we need below:

Proposition

(properties of alternative star algebras)

Let $A$ be an alternative algebra (def. ) which is also a star algebra. Then (using def. ):

the associator vanishes when at least one argument is real

$[Re(a),b,c]$
the associator changes sign when one of its arguments is conjugated

$[a,b,c] = -[a^\ast,b,c] \,;$
the associator vanishes when one of its arguments is the conjugate of another

$[a,a^\ast, b] = 0 \,;$
the associator is purely imaginary

$Re([a,b,c]) = 0 \,.$

Proof

That the associator vanishes as soon as one argument is real is just the linearity of an algebra product over the ground ring.

Hence in fact

[a,b,c] = [Im(a), Im(b), Im(c)] \,.

This implies the second statement by linearity. And so follows the third statement by skew-symmetry:

[a,a^\ast,b] = -[a,a,b] = 0 \,.

The fourth statement finally follows by this computation:

\begin{aligned} \,[ a, b, c]^\ast & = -[c^\ast, b^\ast, a^\ast] \\ & = -[c,b,a] \\ & = -[a,b,c] \end{aligned} \,.

Here the first equation follows by inspection and using that $(a b)^\ast = b^\ast a^\ast$ , the second follows from the first statement above, and the third is the anti-symmetry of the associator.

It is immediate to check that:

Proposition

( $\mathbb{R}$ , $\mathbb{C}$ , $\mathbb{H}$ and $\mathbb{O}$ are real alternative algebras)

The real algebras of real numbers, complex numbers, def. ,quaternions def. and octonions def. are alternative algebras (def. ).

Proof

Since the real numbers, complex numbers and quaternions are associative algebras, their associator vanishes identically. It only remains to see that the associator of the octonions is skew-symmetric. By linearity it is sufficient to check this on generators. So let $e_i \to e_j \to e_k$ be a circle or a cyclic permutation of an edge in the Fano plane. Then by definition of the octonion multiplication we have

\begin{aligned} (e_i e_j) e_j &= e_k e_j \\ &= - e_j e_k \\ & = -e_i \\ & = e_i (e_j e_j) \end{aligned}

and similarly

\begin{aligned} (e_i e_i ) e_j &= - e_j \\ &= - e_k e_i \\ &= e_i e_k \\ &= e_i (e_i e_j) \end{aligned} \,.

The analog of the Hurwitz theorem (prop. ) is now this:

Proposition

( $\mathbb{R}$ , $\mathbb{C}$ , $\mathbb{H}$ and $\mathbb{O}$ are precisely the alternative real division algebras)

The only division algebras over the real numbers which are also alternative algebras (def. ) are the real numbers themselves, the complex numbers, the quaternions and the octonions from prop. .

This is due to (Zorn 30).

For the following, the key point of alternative algebras is this equivalent characterization:

Proposition

(alternative algebra detected on subalgebras spanned by any two elements)

A nonassociative algebra is alternative, def. , precisely if the subalgebra generated by any two elements is an associative algebra.

This is due to Emil Artin, see for instance (Schafer 95, p. 18).

Proposition is what allows to carry over a minimum of linear algebra also to the octonions such as to yield a representation of the Clifford algebra on $\mathbb{R}^{9,1}$ . This happens in the proof of prop. below.

So we will be looking at a fragment of linear algebra over these four normed division algebras. To that end, fix the following notation and terminology:

Definition

(hermitian matrices with values in real normed division algebras)

Let $\mathbb{K}$ be one of the four real normed division algebras from prop. , hence equivalently one of the four real alternative division algebras from prop. .

Say that an $n \times n$ matrix with coefficients in $\mathbb{K}$

A\in Mat_{n\times n}(\mathbb{K})

is a hermitian matrix if the transpose matrix $(A^t)_{i j} \coloneqq A_{j i}$ equals the componentwise conjugated matrix (def. ):

A^t = A^\ast \,.

Hence with the notation

(-)^\dagger \coloneqq ((-)^t)^\ast

we have that $A$ is a hermitian matrix precisely if

A = A^\dagger \,.

We write $Mat_{2 \times 2}^{her}(\mathbb{K})$ for the real vector space of hermitian matrices.

Definition

(trace reversal)

Let $A \in Mat_{2 \times 2}^{her}(\mathbb{K})$ be a hermitian $2 \times 2$ matrix as in def. . Its trace reversal is the result of subtracting its trace times the identity matrix:

\tilde A \;\coloneqq\; A - (tr A) 1_{n\times n} \,.

$\,$

Minkowski spacetime in dimensions 3,4,6 and 10

We now discover Minkowski spacetime of dimension 3,4,6 and 10, in terms of the real normed division algebras $\mathbb{K}$ from prop. , equivalently the real alternative division algebras from prop. : this is prop./def. and def. below.

Proposition/Definition

(Minkowski spacetime as real vector space of hermitian matrices in real normed division algebras)

Let $\mathbb{K}$ be one of the four real normed division algebras from prop. , hence one of the four real alternative division algebras from prop. .

Then the real vector space of $2 \times 2$ hermitian matrices over $\mathbb{K}$ (def. ) equipped with the inner product $\eta$ whose quadratic form ${\vert -\vert^2_\eta}$ is the negative of the determinant operation on matrices is Minkowski spacetime:

(8)

\begin{aligned} \mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} & \coloneqq \left( \mathbb{R}^{dim_{\mathbb{R}(\mathbb{K})}+2} , {\vert -\vert^2_\eta} \right) & \coloneqq \left(Mat_{2 \times 2}^{her}(\mathbb{K}), -det \right) \end{aligned} \,.

hence

$\mathbb{R}^{2,1}$ for $\mathbb{K} = \mathbb{R}$ ;
$\mathbb{R}^{3,1}$ for $\mathbb{K} = \mathbb{C}$ ;
$\mathbb{R}^{5,1}$ for $\mathbb{K} = \mathbb{H}$ ;
$\mathbb{R}^{9,1}$ for $\mathbb{K} = \mathbb{O}$ .

Here we think of the vector space on the left as $\mathbb{R}^{p,1}$ with

p \coloneqq dim_{\mathbb{R}}(\mathbb{K})+1

equipped with the canonical coordinates labeled $(x^\mu)_{\mu = 0}^p$ .

As a linear map the identification is given by

(x^0, x^1, \cdots, x^{d-1}) \;\mapsto\; \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) \;\;\; \text{with}\; y \coloneqq x^2 1 + x^3 e_1 + x^4 e_2 + \cdots + x^{2 + dim_{\mathbb{R}(\mathbb{K})}} \,e_{dim_{\mathbb{R}}(\mathbb{K})-1} \,.

This means that the quadratic form ${\vert - \vert^2_\eta}$ is given on an element $v = (v^\mu)_{\mu = 0}^p$ by

{\vert v \vert}^2_{\eta} \;=\; - (v^0)^2 + \underoverset{j = 1}{p}{\sum} (x^j)^2 \,.

By the polarization identity the quadratic form ${\vert - \vert^2_\eta}$ induces a bilinear form

\eta \;\colon\; \mathbb{R}^{p,1}\otimes \mathbb{R}^{p,1} \longrightarrow \mathbb{R}

given by

\begin{aligned} \eta(v_1, v_2) & = \eta_{\mu \nu} v_1^\mu v_1^\nu \\ & \coloneqq - v_1^0 v_2^0 + \underoverset{j = 1}{p}{\sum} v_1^j v_2^j \end{aligned} \,.

This is called the Minkowski metric.

Finally, under the above identification the operation of trace reversal from def. corresponds to time reversal in that

\widetilde{ \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) } \;=\; \left( \array{ -x^0 + x^1 & y \\ y^\ast & -x^0 - x^1 } \right) \,.

Proof

We need to check that under the given identification, the Minkowski norm-square is indeed given by minus the determinant on the corresponding hermitian matrices. This follows from the nature of the conjugation operation $(-)^\ast$ from def. :

\begin{aligned} - det \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) & = -(x^0 + x^1)(x^0 - x^1) + y y^\ast \\ & = -(x^0)^2 + \underoverset{i = 1}{p}{\sum} (x^i)^2 \end{aligned} \,.

Remark

(physical units of length)

As the term “metric” suggests, in application to physics, the Minkowski metric $\eta$ in prop./def. is regarded as a measure of length: for $v \in \Gamma_x(T \mathbb{R}^{p,1})$ a tangent vector at a point $x$ in Minkowski spacetime, interpreted as a displacement from event $x$ to event $x + v$ , then

if $\eta(v,v) \gt 0$ then

$\sqrt{\eta(v,v)} \in \mathbb{R}$

is interpreted as a measure for the spatial distance between $x$ and $x + v$ ;
if $\eta(v,v) \lt 0$ then

$\sqrt{-\eta(v,v)} \in \mathbb{R}$

is interpreted as a measure for the time distance between $x$ and $x + v$ .

But for this to make physical sense, an operational prescription needs to be specified that tells the experimentor how the real number $\sqrt{\eta(v,v)}$ is to be translated into an physical distance between actual events in the observable universe.

Such an operational prescription is called a physical unit of length. For example “centimeter” $cm$ is a physical unit of length, another one is “femtometer” $fm$ .

The combined information of a real number $\sqrt{\eta(v,v)} \in \mathbb{R}$ and a physical unit of length such as meter, jointly written

\sqrt{\eta(v,v)} \, cm

is a prescription for finding actual distance in the observable universe. Alternatively

\sqrt{\eta(v,v)} \, fm

is another prescription, that translates the same real number $\sqrt{\eta(v,v)}$ into another physical distance.

But of course they are related, since physical units form a torsor over the group $\mathbb{R}_{\gt 0}$ of non-negative real numbers, meaning that any two are related by a unique rescaling. For example

fm = 10^{-13} cm \,,

with $10^{-13} \in \mathbb{R}_{\gt 0}$ .

This means that once any one prescription of turning real numbers into spacetime distances is specified, then any other such prescription is obtained from this by rescaling these real numbers. For example

\begin{aligned} \sqrt{\eta(v,v)} \, fm & = \left( 10^{-13} \sqrt{\eta(v,v)}\right) \,cm \\ & = \sqrt{ 10^{-26} \eta(v,v) } \, cm \end{aligned} \,.

The point to notice here is that, via the last line, we may think of this as rescaling the metric from $\eta$ to $10^{-30} \eta$ .

In quantum field theory physical units of length are typically expressed in terms of a physical unit of “action”, called “Planck's constant” $\hbar$ , via the combination of units called the Compton wavelength

(9)

\ell_m = \frac{2\pi \hbar}{m c} \,.

parameterized, in turn, by a physical unit of mass $m$ . For the mass of the electron, the Compton wavelength is

\ell_e = \frac{2\pi \hbar}{m_e c} \sim 386 \, fm \,.

Another physical unit of length parameterized by a mass $m$ is the Schwarzschild radius $r_m \coloneqq 2 m G/c^2$ , where $G$ is the gravitational constant. Solving the equation

\array{ & \ell_m &=& r_m \\ \Leftrightarrow & 2\pi\hbar / m c &=& 2 m G / c^2 }

for $m$ yields the Planck mass

m_{P} \coloneqq \tfrac{1}{\sqrt{\pi}} m_{\ell = r} = \sqrt{\frac{\hbar c}{G}} \,.

The corresponding Compton wavelength $\ell_{m_{P}}$ is given by the Planck length $\ell_P$

\ell_{P} \coloneqq \tfrac{1}{2\pi} \ell_{m_P} = \sqrt{ \frac{\hbar G}{c^3} } \,.

Definition

(Minkowski spacetime as a pseudo-Riemannian Cartesian space)

Prop./def. introduces Minkowski spacetime $\mathbb{R}^{p,1}$ for $p+1 \in \{3,4,6,10\}$ as a a vector space $\mathbb{R}^{p,1}$ equipped with a norm ${\vert - \vert_\eta}$ . The genuine spacetime corresponding to this is this vector space regaded as a Cartesian space, i.e. with smooth functions (instead of just linear maps) to it and from it (def. ). This still carries one copy of $\mathbb{R}^{p,1}$ over each point $x \in \mathbb{R}^{p,1}$ , as its tangent space (example )

T_x \mathbb{R}^{p,1} \simeq \mathbb{R}^{p,1}

and the Cartesian space $\mathbb{R}^{p,1}$ equipped with the Lorentzian inner product from prop./def. on each tangent space $T_x \mathbb{R}^{p,1}$ (a “pseudo-Riemannian Cartesian space”) is Minkowski spacetime as such.

We write

(10)

dvol_\Sigma \;\coloneqq\; d x^0 \wedge d x^1 \wedge \cdots \wedge d x^p \in \Omega^{p+1}(\mathbb{R}^{p,1})

for the canonical volume form on Minkowski spacetime.

We use the Einstein summation convention: Expressions with repeated indices indicate summation over the range of indices.

For example a differential 1-form $\alpha \in \Omega^1(\mathbb{R}^{p,1})$ on Minkowski spacetime may be expanded as

\alpha = \alpha_\mu d x^\mu \,.

Moreover we use square brackets around indices to indicate skew-symmetrization. For example a differential 2-form $\beta \in \Omega^2(\mathbb{R}^{p,1})$ on Minkowski spacetime may be expanded as

\begin{aligned} \beta & = \beta_{\mu \nu} d x^\mu \wedge d x^\nu \\ & = \beta_{[\mu \nu]} d x^\mu \wedge d x^\nu \end{aligned}

$\,$

The identification of Minkowski spacetime (def. ) in the exceptional dimensions with the generalized Pauli matrices (prop./def. ) has some immediate useful implications:

Proposition

(Minkowski metric in terms of trace reversal)

In terms of the trace reversal operation $\widetilde{(-)}$ from def. , the determinant operation on hermitian matrices (def. ) has the following alternative expression

\begin{aligned} -det(A) & = A \tilde A \\ & = \tilde A A \end{aligned} \,.

and the Minkowski inner product from prop. has the alternative expression

\begin{aligned} \eta(A,B) & = \tfrac{1}{2}Re(tr(A \tilde B)) \\ & = \tfrac{1}{2} Re(tr(\tilde A B)) \end{aligned} \,.

(Baez-Huerta 09, prop. 5)

Proposition

(special linear group $SL(2,\mathbb{K})$ acts by linear isometries on Minkowski spacetime )

For $\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}, \mathbb{O}\}$ one of the four real normed division algebras (prop. ) the special linear group $SL(2,\mathbb{K})$ acts on Minkowski spacetime $\mathbb{R}^{p,1}$ in dimension $p+1 \in \{2+1, \,3+1, \, 5+1. \, 9+1\}$ (def. ) by linear isometries given under the identification with the Pauli matrices in prop./def. by conjugation:

\array{ SL(2,\mathbb{K}) \times \mathbb{R}^{dim(\mathbb{K}+1,1)} & \simeq & SL(2, \mathbb{K}) \times Mat^{herm}_{2 \times 2}(\mathbb{K}) &\overset{}{\longrightarrow}& Mat^{herm}_{2 \times 2}(\mathbb{K}) & \simeq & \mathbb{R}^{dim(\mathbb{K}+1,1)} \\ && (G, A) &\mapsto& G \, A \, G^\dagger }

Proof

For $\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}\}$ this is immediate from matrix calculus, but we spell it out now. While the argument does not directly apply to the case $\mathbb{K} = \mathbb{O}$ of the octonions, one can check that it still goes through, too.

First we need to see that the action is well defined. This follows from the associativity of matrix multiplication and the fact that forming conjugate transpose matrices is an antihomomorphism: $(G_1 G_2)^\dagger = G_2^\dagger G_1^\dagger$ . In particular this implies that the action indeed sends hermitian matrices to hermitian matrices:

\begin{aligned} \left( G \, A \, G^\dagger \right)^\dagger & = \underset{= G}{\underbrace{\left( G^\dagger \right)}} \, \underset{= A}{\underbrace{A^\dagger}} \, G^\dagger \\ & = G \, A \, G^\dagger \end{aligned} \,.

By prop./def. such an action is an isometry precisely if it preserves the determinant. This follows from the multiplicative property of determinants: $det(A B) = det(A) det(B)$ and their compativility with conjugate transposition: $det(A^\dagger) = det(A^\ast)$ , and finally by the assumption that $G \in SL(2,\mathbb{K})$ is an element of the special linear group, hence that its determinant is $1 \in \mathbb{K}$ :

\begin{aligned} det\left( G \, A \, G^\dagger \right) & = \underset{ = 1}{\underbrace{det(G)}} \, det(A) \, \underset{= 1^\ast = 1}{\underbrace{det(G^\dagger)}} \\ & = det(A) \end{aligned} \,.

In fact the special linear groups of linear isometries in prop. are the spin groups (def. below) in these dimensions.

exceptional spinors and real normed division algebras

Lorentzian spacetime dimension	$\phantom{AA}$ spin group	normed division algebra	$\,\,$ brane scan entry
$3 = 2+1$	$Spin(2,1) \simeq SL(2,\mathbb{R})$	$\phantom{A}$ $\mathbb{R}$ the real numbers	super 1-brane in 3d
$4 = 3+1$	$Spin(3,1) \simeq SL(2, \mathbb{C})$	$\phantom{A}$ $\mathbb{C}$ the complex numbers	super 2-brane in 4d
$6 = 5+1$	$Spin(5,1) \simeq$ SL(2,H)	$\phantom{A}$ $\mathbb{H}$ the quaternions	little string
$10 = 9+1$	Spin(9,1) ${\simeq}$ “SL(2,O)”	$\phantom{A}$ $\mathbb{O}$ the octonions	heterotic/type II string

This we explain now.

$\,$

Lorentz group and spin group

Definition

(Lorentz group)

For $d \in \mathbb{N}$ , write

O(d-1,1) \hookrightarrow GL(\mathbb{R}^d)

for the subgroup of the general linear group on those linear maps $A$ which preserve this bilinear form on Minkowski spacetime (def ), in that

\eta(A(-),A(-)) = \eta(-,-) \,.

This is the Lorentz group in dimension $d$ .

The elements in the Lorentz group in the image of the special orthogonal group $SO(d-1) \hookrightarrow O(d-1,1)$ are rotations in space. The further elements in the special Lorentz group $SO(d-1,1)$ , which mathematically are “hyperbolic rotations” in a space-time plane, are called boosts in physics.

One distinguishes the following further subgroups of the Lorentz group $O(d-1,1)$ :

the proper Lorentz group

$SO(d-1,1) \hookrightarrow O(d-1,1)$

is the subgroup of elements which have determinant +1 (as elements $SO(d-1,1)\hookrightarrow GL(d)$ of the general linear group);
the proper orthochronous (or restricted) Lorentz group

$SO^+(d-1,1) \hookrightarrow SO(d-1,1)$

is the further subgroup of elements $A$ which preserve the time orientation of vectors $v$ in that $(v^0 \gt 0) \Rightarrow ((A v)^0 \gt 0)$ .

Proposition

(connected component of Lorentz group)

As a smooth manifold, the Lorentz group $O(d-1,1)$ (def. ) has four connected components. The connected component of the identity is the proper orthochronous Lorentz group $SO^+(3,1)$ (def. ). The other three components are

$SO^+(d-1,1)\cdot P$
$SO^+(d-1,1)\cdot T$
$SO^+(d-1,1)\cdot P T$ ,

where, as matrices,

P \coloneqq diag(1,-1,-1, \cdots, -1)

is the operation of point reflection at the origin in space, where

T \coloneqq diag(-1,1,1, \cdots, 1)

is the operation of reflection in time and hence where

P T = T P = diag(-1,-1, \cdots, -1)

is point reflection in spacetime.

The following concept of the Clifford algebra (def. ) of Minkowski spacetime encodes the structure of the inner product space $\mathbb{R}^{d-1,1}$ in terms of algebraic operation (“geometric algebra”), such that the action of the Lorentz group becomes represented by a conjugation action (example below). In particular this means that every element of the proper orthochronous Lorentz group may be “split in half” to yield a double cover: the spin group (def. below).

Definition

(Clifford algebra)

For $d \in \mathbb{N}$ , we write

Cl(\mathbb{R}^{d-1,1})

for the $\mathbb{Z}/2$ -graded associative algebra over $\mathbb{R}$ which is generated from $d$ generators $\{\Gamma_0, \Gamma_1, \Gamma_2, \cdots, \Gamma_{d-1}\}$ in odd degree (“Clifford generators”), subject to the relation

(11)

\Gamma_{a} \Gamma_b + \Gamma_b \Gamma_a = - 2\eta_{a b}

where $\eta$ is the inner product of Minkowski spacetime as in def. .

These relations say equivalently that

\begin{aligned} & \Gamma_0^2 = +1 \\ & \Gamma_i^2 = -1 \;\; \text{for}\; i \in \{1,\cdots, d-1\} \\ & \Gamma_a \Gamma_b = - \Gamma_b \Gamma_a \;\;\; \text{for}\; a \neq b \end{aligned} \,.

We write

\Gamma_{a_1 \cdots a_p} \;\coloneqq\; \frac{1}{p!} \underset{{permutations \atop \sigma}}{\sum} (-1)^{\vert \sigma\vert } \Gamma_{a_{\sigma(1)}} \cdots \Gamma_{a_{\sigma(p)}}

for the antisymmetrized product of $p$ Clifford generators. In particular, if all the $a_i$ are pairwise distinct, then this is simply the plain product of generators

\Gamma_{a_1 \cdots a_n} = \Gamma_{a_1} \cdots \Gamma_{a_n} \;\;\; \text{if} \; \underset{i,j}{\forall} (a_i \neq a_j) \,.

Finally, write

\overline{(-)} \;\colon\; Cl(\mathbb{R}^{d-1,1}) \longrightarrow Cl(\mathbb{R}^{d-1,1})

for the algebra anti-automorphism given by

\overline{\Gamma_a} \coloneqq \Gamma_a

\overline{\Gamma_a \Gamma_b} \coloneqq \Gamma_b \Gamma_a \,.

Remark

(vectors inside Clifford algebra)

By construction, the vector space of linear combinations of the generators in a Clifford algebra $Cl(\mathbb{R}^{d-1,1})$ (def. ) is canonically identified with Minkowski spacetime $\mathbb{R}^{d-1,1}$ (def. )

\widehat{(-)} \;\colon\; \mathbb{R}^{d-1,1} \hookrightarrow Cl(\mathbb{R}^{d-1,1})

via

x_a \mapsto \Gamma_a \,,

hence via

v = v^a x_a \mapsto \hat v = v^a \Gamma_a \,,

such that the defining quadratic form on $\mathbb{R}^{d-1,1}$ is identified with the anti-commutator in the Clifford algebra

\eta(v_1,v_2) = -\tfrac{1}{2}( \hat v_1 \hat v_2 + \hat v_2 \hat v_1) \,,

where on the right we are, in turn, identifying $\mathbb{R}$ with the linear span of the unit in $Cl(\mathbb{R}^{d-1,1})$ .

The key point of the Clifford algebra (def. ) is that it realizes spacetime reflections, rotations and boosts via conjugation actions:

Example

(Clifford conjugation)

For $d \in \mathbb{N}$ and $\mathbb{R}^{d-1,1}$ the Minkowski spacetime of def. , let $v \in \mathbb{R}^{d-1,1}$ be any vector, regarded as an element $\hat v \in Cl(\mathbb{R}^{d-1,1})$ via remark .

Then

the conjugation action $\hat v \mapsto -\Gamma_a^{-1} \hat v \Gamma_a$ of a single Clifford generator $\Gamma_a$ on $\hat v$ sends $v$ to its

reflection at the hyperplane $x_a = 0$ ;

the conjugation action

$\hat v \mapsto \exp(- \tfrac{\alpha}{2} \Gamma_{a b}) \hat v \exp(\tfrac{\alpha}{2} \Gamma_{a b})$

sends $v$ to the result of rotating it in the $(a,b)$ -plane through an angle $\alpha$ .

Proof

This is immediate by inspection:

For the first statement, observe that conjugating the Clifford generator $\Gamma_b$ with $\Gamma_a$ yields $\Gamma_b$ up to a sign, depending on whether $a = b$ or not:

- \Gamma_a^{-1} \Gamma_b \Gamma_a = \left\{ \array{ -\Gamma_b & \vert \text{if}\, a = b \\ \Gamma_b & \vert \text{otherwise} } \right. \,.

Therefore for $\hat v = v^b \Gamma_b$ then $\Gamma_a^{-1} \hat v \Gamma_a$ is the result of multiplying the $a$ -component of $v$ by $-1$ .

For the second statement, observe that

-\tfrac{1}{2}[\Gamma_{a b}, \Gamma_c] = \Gamma_a \eta_{b c} - \Gamma_b \eta_{a c} \,.

This is the canonical action of the Lorentzian special orthogonal Lie algebra $\mathfrak{so}(d-1,1)$ . Hence

\exp(-\tfrac{\alpha}{2} \Gamma_{ab}) \hat v \exp(\tfrac{\alpha}{2} \Gamma_{ab}) = \exp(\tfrac{1}{2}[\Gamma_{a b}, -])(\hat v)

is the rotation action as claimed.

Remark

Since the reflections, rotations and boosts in example are given by conjugation actions, there is a crucial ambiguity in the Clifford elements that induce them:

the conjugation action by $\Gamma_a$ coincides precisely with the conjugation action by $-\Gamma_a$ ;
the conjugation action by $\exp(\tfrac{\alpha}{4} \Gamma_{a b})$ coincides precisely with the conjugation action by $-\exp(\tfrac{\alpha}{2}\Gamma_{a b})$ .

Definition

(spin group)

For $d \in \mathbb{N}$ , the spin group $Spin(d-1,1)$ is the group of even graded elements of the Clifford algebra $Cl(\mathbb{R}^{d-1,1})$ (def. ) which are unitary with respect to the conjugation operation $\overline{(-)}$ from def. :

Spin(d-1,1) \;\coloneqq\; \left\{ A \in Cl(\mathbb{R}^{d-1,1})_{even} \;\vert\; \overline{A} A = 1 \right\} \,.

Proposition

The function

Spin(d-1,1) \longrightarrow GL(\mathbb{R}^{d-1,1})

from the spin group (def. ) to the general linear group in $d$ -dimensions given by sending $A \in Spin(d-1,1) \hookrightarrow Cl(\mathbb{R}^{d-1,1})$ to the conjugation action

\overline{A}(-) A

(via the identification of Minkowski spacetime as the subspace of the Clifford algebra containing the linear combinations of the generators, according to remark )

a group homomorphism onto the proper orthochronous Lorentz group (def. ):

$Spin(d-1,1) \longrightarrow SO^+(d-1,1)$
exhibiting a $\mathbb{Z}/2$ -central extension.

Proof

That the function is a group homomorphism into the general linear group, hence that it acts by linear transformations on the generators follows by using that it clearly lands in automorphisms of the Clifford algebra.

That the function lands in the Lorentz group $O(d-1,1) \hookrightarrow GL(d)$ follows from remark :

\begin{aligned} \eta(\overline{A}v_1A , \overline{A} v_2 A) &= \tfrac{1}{2} \left( \left(\overline{A} \hat v_1 A\right) \left(\overline{A}\hat v_2 A\right) + \left(\overline{A} \hat v_2 A\right) \left(\overline{A} \hat v_1 A\right) \right) \\ & = \tfrac{1}{2} \left( \overline{A}(\hat v_1 \hat v_2 + \hat v_2 \hat v_1) A \right) \\ & = \overline{A} A \tfrac{1}{2}\left( \hat v_1 \hat v_2 + \hat v_2 \hat v_1\right) \\ & = \eta(v_1, v_2) \end{aligned} \,.

That it moreover lands in the proper Lorentz group $SO(d-1,1)$ follows from observing (example ) that every reflection is given by the conjugation action by a linear combination of generators, which are excluded from the group $Spin(d-1,1)$ (as that is defined to be in the even subalgebra).

To see that the homomorphism is surjective, use that all elements of $SO(d-1,1)$ are products of rotations in hyperplanes. If a hyperplane is spanned by the bivector $(\omega^{a b})$ , then such a rotation is given, via example by the conjugation action by

\exp(\tfrac{\alpha}{2} \omega^{a b}\Gamma_{a b})

for some $\alpha$ , hence is in the image.

That the kernel is $\mathbb{Z}/2$ is clear from the fact that the only even Clifford elements which commute with all vectors are the multiples $a \in \mathbb{R} \hookrightarrow Cl(\mathbb{R}^{d-1,1})$ of the identity. For these $\overline{a} = a$ and hence the condition $\overline{a} a = 1$ is equivalent to $a^2 = 1$ . It is clear that these two elements $\{+1,-1\}$ are in the center of $Spin(d-1,1)$ . This kernel reflects the ambiguity from remark .

$\,$

Spinors in dimensions 3, 4, 6 and 10

We now discuss how real spin representations (def. ) in spacetime dimensions 3,4, 6 and 10 are naturally induced from linear algebra over the four real alternative division algebras (prop. ).

Definition

(Clifford algebra via normed division algebra)

Let $\mathbb{K}$ be one of the four real normed division algebras from prop. , hence one of the four real alternative division algebras from prop. .

Define a real linear map

\Gamma \;\colon\; \mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} \longrightarrow End_{\mathbb{R}}(\mathbb{K}^4)

from (the real vector space underlying) Minkowski spacetime to real linear maps on $\mathbb{K}^4$

\Gamma(A) \left( \array{ \psi \\ \phi } \right) \;\coloneqq\; \left( \array{ - \tilde A \phi \\ A \psi } \right) \,.

Here on the right we are using the isomorphism from prop. for identifying a spacetime vector with a $2 \times 2$ -matrix, and we are using the trace reversal $\widetilde(-)$ from def. .

Remark

(Clifford multiplication via octonion-valued matrices)

Each operation of $\Gamma(A)$ in def. is clearly a linear map, even for $\mathbb{K}$ being the non-associative octonions. The only point to beware of is that for $\mathbb{K}$ the octonions, then the composition of two such linear maps is not in general given by the usual matrix product.

Proposition

(real spin representations via normed division algebras)

The map $\Gamma$ in def. gives a representation of the Clifford algebra $Cl(\mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} )$ (this def.), i.e of

$Cl(\mathbb{R}^{2,1})$ for $\mathbb{K} = \mathbb{R}$ ;
$Cl(\mathbb{R}^{3,1})$ for $\mathbb{K} = \mathbb{C}$ ;
$Cl(\mathbb{R}^{5,1})$ for $\mathbb{K} = \mathbb{H}$ ;
$Cl(\mathbb{R}^{9,1})$ for $\mathbb{K} = \mathbb{O}$ .

Hence this Clifford representation induces representations of the spin group $Spin(dim_{\mathbb{R}}(\mathbb{K})+1,1)$ on the real vector spaces

S_{\pm } \coloneqq \mathbb{K}^2 \,.

and hence on

S \coloneqq S_+ \oplus S_- \,.

(Baez-Huerta 09, p. 6)

Proof

We need to check that the Clifford relation

\begin{aligned} (\Gamma(A))^2 & = -\eta(A,A)1 \\ & = + det(A) \end{aligned}

is satisfied (where we used (11) and (8)). Now by definition, for any $(\phi,\psi) \in \mathbb{K}^4$ then

(\Gamma(A))^2 \left( \array{ \phi \\ \psi } \right) \;=\; - \left( \array{ \tilde A(A \phi) \\ A(\tilde A \psi) } \right) \,,

where on the right we have in each component ordinary matrix product expressions.

Now observe that both expressions on the right are sums of triple products that involve either one real factor or two factors that are conjugate to each other:

\begin{aligned} A (\tilde A \psi) & = \left( \array{ x_0 + x_1 & y \\ y^\ast & x_0 - x_1 } \right) \cdot \left( \array{ (-x_0 + x_1) \phi_1 + y \phi_2 \\ y^\ast \phi_1 - (x_0 + x_1)\phi_2 } \right) \\ & = \left( \array{ (-x_0^2 + x_1^2) \phi_1 + (x_0 + x_1)(y \phi_2) + y (y^\ast \phi_1) - y( (x_0 + x_1) \phi_2 ) \\ \cdots } \right) \end{aligned} \,.

Since the associators of triple products that involve a real factor and those involving both an element and its conjugate vanish by prop. (hence ultimately by Artin’s theorem, prop. ). In conclusion all associators involved vanish, so that we may rebracket to obtain

(\Gamma(A))^2 \left( \array{ \phi \\ \psi } \right) \;=\; - \left( \array{ (\tilde A A) \phi \\ (A \tilde A) \psi } \right) \,.

This implies the statement via the equality $-A \tilde A = -\tilde A A = det(A)$ (prop. ).

Proposition

(spinor bilinear pairings)

Let $\mathbb{K}$ be one of the four real normed division algebras and $S_\pm \simeq_{\mathbb{R}}\mathbb{K}^2$ the corresponding spin representation from prop. .

Then there are bilinear maps from two spinors (according to prop. ) to the real numbers

\overline{(-)}(-) \;\colon\; S_+ \otimes S_-\longrightarrow \mathbb{R}

as well as to $\mathbb{R}^{dim(\mathbb{K}+1,1)}$

\overline{(-)}\Gamma (-) \;\colon\; S_\pm \otimes S_{\pm}\longrightarrow \mathbb{R}^{dim(\mathbb{K}+1,1)}

given, respectively, by forming the real part (def. ) of the canonical $\mathbb{K}$ -inner product

\overline{(-)}(-) \colon S_+\otimes S_- \longrightarrow \mathbb{R}

(\psi,\phi)\mapsto \overline{\psi} \phi \coloneqq Re(\psi^\dagger \cdot \phi)

and by forming the product of a column vector with a row vector to produce a matrix, possibly up to trace reversal (def. ) under the identification $\mathbb{R}^{dim(\mathbb{K})+1,1} \simeq Mat^{her}_{2 \times 2}(\mathbb{K})$ from prop. :

S_+ \otimes S_+ \longrightarrow \mathbb{R}^{dim(\mathbb{K})+1,1}

(\psi , \phi) \mapsto \overline{\psi}\Gamma \phi \coloneqq \widetilde{\psi \phi^\dagger + \phi \psi^\dagger}

and

S_- \otimes S_- \longrightarrow \mathbb{R}^{dim(\mathbb{K}+1,1)}

(\psi , \phi) \mapsto {\psi \phi^\dagger + \phi \psi^\dagger}

For $A \in Mat^{her}_{2 \times 2}(\mathbb{K})$ the $A$ -component of this map is

\eta(\overline{\psi}\Gamma \phi, A) = Re (\psi^\dagger (A\phi)) \,.

These pairings have the following properties

both are $Spin(dim(\mathbb{K})+1,1)$ -equivalent;
the pairing $\overline{(-)}\Gamma(-)$ is symmetric:

(12) $\overline{\psi_1} \,\Gamma\, \psi_2 = + \overline{\phi_2}\, \Gamma\, \psi_1 \phantom{AAAA} \text{for} \phantom{AA} \psi_1, \psi_2 \in S_+ \oplus S_-$

(Baez-Huerta 09, prop. 8, prop. 9).

Remark

(two-component spinor notation)

In the physics/QFT literature the expressions for spin representations given by prop. are traditionally written in two-component spinor notation as follows:

An element of $S_+$ is denoted $(\chi_a \in \mathbb{K})_{a = 1,2}$ and called a left handed spinor;
an element of $S_-$ is denoted $(\xi^{\dagger \dot a})_{\dot a = 1,2}$ and called a right handed spinor;
an element of $S = S_+ \oplus S_-$ is denoted

(13) $(\psi^\alpha) = \left( (\chi_a), (\xi^{\dagger \dot a}) \right)$

and called a Dirac spinor;

and the Clifford action of prop. corresponds to the generalized “Pauli matrices”:

a hermitian matrix $A \in Mat^{her}_{2\times 2}(\mathbb{K})$ as in prop regarded as a linear map $S_- \to S_+$ via def. is denoted

$\left(x_\mu \sigma^\mu_{a \dot a}\right) \;\coloneqq\; \left( \array{ x_0 + x_1 & y \\ y^\ast & x_0 - x_1 } \right) \,;$
the negative of the trace-reversal (def. ) of such a hermitian matrix, regarded as a linear map $S_+ \to S_-$ , is denoted

$\left( x_\mu \widetilde \sigma^{\mu \dot a a} \right) \;\coloneqq\; - \left( \array{ -x_0 + x_1 & y \\ y^\ast & -x_0 - x_1 } \right) \,.$
the corresponding Clifford generator $\Gamma(A) \;\colon\; S_+ \oplus S_- \to S_+ \oplus S_-$ (def. ) is denoted

$x_\mu (\gamma^\mu)_{\alpha \beta} \;\coloneqq\; \left( \array{ 0 & x_\mu \sigma^\mu_{a \dot b} \\ x_\mu \widetilde \sigma^{\mu \dot a b} } \right)$
the bilinear spinor-to-vector pairing from prop. is written as the matrix multiplication

$\left( \overline{\psi} \, \gamma^\mu \, \phi\right) \;\coloneqq\; \overline{\psi}\,\Gamma \,\phi \,,$

where the Dirac conjugate $\overline{\psi}$ on the left is given on $(\psi_\alpha) = (\chi_a, \xi^{\dagger \dot c})$ by

(14) $\begin{aligned} \overline{\psi} & \coloneqq \psi^\dagger \gamma^0 \\ & = ( \xi^a, \chi^\dagger_{\dot a} ) \end{aligned}$

hence, with (13):

(15) $\begin{aligned} \overline{\psi_1} \,\gamma^\mu\, \psi_2 & = \psi_1^\dagger \, \gamma^0 \gamma^\mu \, \psi_2 \\ & = (\xi_1)^a \, \sigma^\mu_{a \dot c}\, (\xi_2)^{\dagger \dot c} + (\chi_1)^\dagger_{\dot a} \, \widetilde \sigma^{\mu \dot a c} \, (\chi_2)_c \end{aligned}$

Finally, it is common to abbreviate contractions with the Clifford algebra generators $(\gamma^\mu)$ by a slash, as in

k\!\!\!/\, \;\coloneqq\; \gamma^\mu k_\mu

(16)

i \partial\!\!\!/\, \;\coloneqq\; i \gamma^\mu \frac{\partial}{\partial x^\mu} \,.

This is called the Feynman slash notation.

(e.g. Dermisek I-8, Dermisek I-9)

Below we spell out the example of the Lagrangian field theory of the Dirac field in detail (example ). For discussion of massive chiral spinor fields one also needs the following, here we just mention this for completeness:

Proposition

(chiral spinor mass pairing)

In dimension 2+1 and 3+1, there exists a non-trivial skew-symmetric pairing

\epsilon \;\colon\; S \wedge S \longrightarrow \mathbb{R}

which may be normalized such that in the two-component spinor basis of remark we have

(17)

\tilde \sigma^{\mu \dot a a} = \epsilon^{a b} \epsilon^{\dot a \dot b} \sigma^\mu_{b \dot b} \,.

Proof

Take the non-vanishing components of $\epsilon$ to be

\epsilon^{1 2} = \epsilon^{\dot 1 \dot 2} = \epsilon_{21} = \epsilon_{\dot 2 \dot 1} = 1

and

\epsilon^{2 1} = \epsilon^{\dot 2 \dot 1} = \epsilon_{1 2} = \epsilon_{\dot 1 \dot 2} = -1 \,.

With this equation (17) is checked explicitly. It is clear that $\epsilon$ thus defined is skew symmetric as long as the component algebra is commutative, which is the case for $\mathbb{K}$ being $\mathbb{R}$ or $\mathbb{C}$ .

$\,$

Causal structure

We need to consider the following concepts and constructions related to the causal structure of Minkowski spacetime $\Sigma$ (def. ).

Definition

(spacelike, timelike, lightlike directions; past and future)

Given two points $x,y \in \Sigma$ in Minkowski spacetime (def. ), write

v \coloneqq y - x \in \mathbb{R}^{p,1}

for their difference, using the vector space structure underlying Minkowski spacetime.

Recall the Minkowski inner product $\eta$ on $\mathbb{R}^{p,1}$ , given by prop./def. . Then via remark we say that the difference vector $v$ is

spacelike if $\eta(v,v) \gt 0$ ,
timelike if $\eta(v,v) \lt 0$ ,
lightlike if $\eta(v,v) = 0$ .

If $v$ is timelike or lightlike then we say that

$y$ is in the future of $x$ if $y^0 - x^0 \geq 0$ ;
$y$ is in the past of $x$ if $y^0 - x^0 \leq 0$ .

Definition

(causal cones)

For $x \in \Sigma$ a point in spacetime (an event), we write

V^+(x), V^-(x) \subset \Sigma

for the subsets of events that are in the timelike future or in the timelike past of $x$ , respectively (def. ) called the open future cone and open past cone, respectively, and

\overline{V}^+(x), \overline{V}^-(x) \subset \Sigma

for the subsets of events that are in the timelike or lightlike future or past, respectivel, called the closed future cone and closed past cone, respectively.

The union

J(x) \coloneqq \overline{V}^+(x) \cup \overline{V}^-(x)

of the closed future cone and past cone is called the full causal cone of the event $x$ . Its boundary is the light cone.

More generally for $S \subset \Sigma$ a subset of events we write

\overline{V}^\pm(S) \;\coloneqq\; \underset{x \in S}{\cup} \overline{V}^{\pm}(x)

for the union of the future/past closed cones of all events in the subset.

Definition

(compactly sourced causal support)

Consider a vector bundle $E \overset{}{\to} \Sigma$ (def. ) over Minkowski spacetime (def. ).

Write $\Gamma_{\Sigma}(E)$ for the spaces of smooth sections (def. ), and write

\begin{aligned} \Gamma_{cp}(E) & \,\text{compact support} \\ \Gamma_{\Sigma,\pm cp}(E) & \,\text{compactly sourced future/past support} \\ \Gamma_{\Sigma,scp}(E) & \,\text{spacelike compact support} \\ \Gamma_{\Sigma,(f/p)cp}(E) & \,\text{future/past compact support} \\ \Gamma_{\Sigma,tcp}(E) & \,\text{timelike compact support} \end{aligned}

for the subsets on those smooth sections whose support is

( $cp$ ) inside a compact subset,
( $\pm cp$ ) inside the closed future cone/closed past cone, respectively, of a compact subset,
( $scp$ ) inside the closed causal cone of a compact subset, which equivalently means that the intersection with every (spacelike) Cauchy surface is compact (Sanders 13, theorem 2.2),
( $fcp$ ) inside the past of a Cauchy surface (Sanders 13, def. 3.2),
( $pcp$ ) inside the future of a Cauchy surface (Sanders 13, def. 3.2),
( $tcp$ ) inside the future of one Cauchy surface and the past of another (Sanders 13, def. 3.2).

(Bär 14, section 1, Khavkine 14, def. 2.1)

Definition

(causal order)

Consider the relation on the set $P(\Sigma)$ of subsets of spacetime which says a subset $S_1 \subset \Sigma$ is not prior to a subset $S_2 \subset \Sigma$ , denoted $S_1 {\vee\!\!\!\wedge} S_2$ , if $S_1$ does not intersect the causal past of $S_2$ (def. ), or equivalently that $S_2$ does not intersect the causal future of $S_1$ :

\begin{aligned} S_1 {\vee\!\!\!\wedge} S_2 & \;\;\coloneqq\;\; S_1 \cap \overline{V}^-(S_2) = \emptyset \\ & \;\;\Leftrightarrow\;\; S_2 \cap \overline{V}^+(S_1) = \emptyset \end{aligned} \,.

(Beware that this is just a relation, not an ordering, since it is not relation.)

If $S_1 {\vee\!\!\!\wedge} S_2$ and $S_2 {\vee\!\!\!\wedge} S_1$ we say that the two subsets are spacelike separated and write

S_1 {\gt\!\!\!\!\lt} S_2 \;\;\;\coloneqq\;\;\; S_1 {\vee\!\!\!\wedge} S_2 \;\text{and}\; S_2 {\vee\!\!\!\wedge} S_1 \,.

Definition

(causal complement and causal closure of subset of spacetime)

For $S \subset X$ a subset of spacetime, its causal complement $S^\perp$ is the complement of the causal cone:

S^\perp \;\coloneqq\; S \setminus J_X(S) \,.

The causal complement $S^{\perp \perp}$ of the causal complement $S^\perp$ is called the causal closure. If

S = S^{\perp \perp}

then the subset $S$ is called a causally closed subset.

Given a spacetime $\Sigma$ , we write

CausClsdSubsets(\Sigma) \;\in\; Cat

for the partially ordered set of causally closed subsets, partially ordered by inclusion $\mathcal{O}_1 \subset \mathcal{O}_2$ .

Definition

(adiabatic switching)

For a causally closed subset $\mathcal{O} \subset \Sigma$ of spacetime (def. ) say that an adiabatic switching function or infrared cutoff function for $\mathcal{O}$ is a smooth function $g_{sw}$ of compact support (a bump function) whose restriction to some neighbourhood $U$ of $\mathcal{O}$ is the constant function with value $1$ :

Cutoffs(\mathcal{O}) \;\coloneqq\; \left\{ g_{sw} \in C^\infty_c(\Sigma) \;\vert\; \underset{ {U \supset \mathcal{O}} \atop { \text{neighbourhood} } }{\exists} \left( g_{sw}\vert_U = 1 \right) \right\} \,.

Often we consider the vector space space $C^\infty(\Sigma)\langle g \rangle$ spanned by a formal variable $g$ (the coupling constant) under multiplication with smooth functions, and consider as adiabatic switching functions the corresponding images in this space,

\array{ C_c^\infty(\Sigma) &\overset{\simeq}{\longrightarrow}& C_c^\infty(X)\langle g\rangle }

which are thus bump functions constant over a neighbourhood $U$ of $\mathcal{O}$ not on 1 but on the formal parameter $g$ :

g_{sw}\vert_U = g \,

In this sense we may think of the adiabatic switching as being the spacetime-depependent coupling “constant”.

The following lemma will be key in the derivation (proof of prop. below) of the causal locality of algebra of quantum observables in perturbative quantum field theory:

Lemma

(causal partition)

Let $\mathcal{O} \subset \Sigma$ be a causally closed subset (def. ) and let $f \in C^\infty_{cp}(\Sigma)$ be a compactly supported smooth function which vanishes on a neighbourhood $U \supset \mathcal{O}$ , i.e. $f\vert_U = 0$ .

Then there exists a causal partition of $f$ in that there exist compactly supported smooth functions $a,r \in C^\infty_{cp}(\Sigma)$ such that

they sum up to $f$ :

$f = a + r$
their support satisfies the following causal ordering (def. )

$supp(a) {\vee\!\!\!\wedge} \mathcal{O} {\vee\!\!\!\wedge} supp(r) \,.$

Proof idea

By assumption $\mathcal{O}$ has a Cauchy surface. This may be extended to a Cauchy surface $\Sigma_p$ of $\Sigma$ , such that this is one leaf of a foliation of $\Sigma$ by Cauchy surfaces, given by a diffeomorphism $\Sigma \simeq (-1,1) \times \Sigma_p$ with the original $\Sigma_p$ at zero. There exists then $\epsilon \in (0,1)$ such that the restriction of $supp(f)$ to the interval $(-\epsilon, \epsilon)$ is in the causal complement $\overline{\mathcal{O}}$ of the given region (def. ):

supp(f) \cap (-\epsilon, \epsilon) \times \Sigma_p \;\subset\; \overline{\mathcal{O}} \,.

Let then $\chi \colon \Sigma \to \mathbb{R}$ be any smooth function with

$\chi\vert_{(-1,0] \times \Sigma_p} = 1$
$\chi\vert_{(\epsilon,1) \times \Sigma_p} = 0$ .

Then

r \coloneqq \chi \cdot f \phantom{AAA} \text{and} \phantom{AAA} a \coloneqq (1-\chi) \cdot f

are smooth functions as required.

$\,$

This concludes our discussion of spin and spacetime. In the next chapter we consider the concept of fields on spacetime.

Fields

In this chapter we discuss these topics:

Field bundles
Spaces of field histories
Infinitesimal geometry
Fermion fields and Supergeometry

$\,$

A field history on a given spacetime $\Sigma$ (a history of spatial field configurations, see remark below) is a quantity assigned to each point of spacetime (each event), such that this assignment varies smoothly with spacetime points. For instance an electromagnetic field history (example below) is at each point of spacetime a collection of vectors that encode the direction in which a charged particle passing through that point would feel a force (the “Lorentz force”, see example below).

This is readily formalized (def. below): If $F$ denotes the smooth manifold of “values” that the given kind of field may take at any spacetime point, then a field history $\Phi$ is modeled as a smooth function from spacetime to this space of values:

\Phi \;\colon\; \Sigma \longrightarrow F \,.

It will be useful to unify spacetime and the space of field values (the field fiber) into a single manifold, the Cartesian product

E \;\coloneqq\; \Sigma \times F

and to think of this equipped with the projection map onto the first factor as a fiber bundle of spaces of field values over spacetime

\array{ E &\coloneqq& \Sigma \times F \\ {}^{\mathllap{fb}}\downarrow & \swarrow_{\mathrlap{pr_1}} \\ \Sigma } \,.

This is then called the field bundle, which specifies the kind of values that the given field species may take at any point of spacetime. Since the space $F$ of field values is the fiber of this fiber bundle (def. ), it is sometimes also called the field fiber. (See also at fiber bundles in physics.)

Given a field bundle $E \overset{fb}{\to}\Sigma$ , then a field history is a section of that bundle (def. ). The discussion of field theory concerns the space of all possible field histories, hence the space of sections of the field bundle (example below). This is a very “large” generalized smooth space, called a diffeological space (def. below).

Or rather, in the presence of fermion fields such as the Dirac field (example below), the Pauli exclusion principle demands that the field bundle is a super-manifold, and that the fermionic space of field histories (example below) is a super-geometric generalized smooth space: a super smooth set (def. below).

This smooth structure on the space of field histories will be crucial when we discuss observables of a field theory below, because these are smooth functions on the space of field histories. In particular it is this smooth structure which allows to derive that linear observables of a free field theory are given by distributions (prop. ) below. Among these are the point evaluation observables (delta distributions) which are traditionally denoted by the same symbol as the field histories themselves.

Hence there are these aspects of the concept of “field” in physics, which are closely related, but crucially different:

$\,$

aspects of the concept of fields

aspect	term	type	description	def.
field component	$\phi^a$ , $\phi^a_{,\mu}$	$J^\infty_\Sigma(E) \to \mathbb{R}$	coordinate function on jet bundle of field bundle	def. , def.
field history	$\Phi$ , $\frac{\partial \Phi}{\partial x^\mu}$	$\Sigma \to J^\infty_\Sigma(E)$	jet prolongation of section of field bundle	def. , def.
field observable	$\mathbf{\Phi}^a(x)$ , $\partial_{\mu} \mathbf{\Phi}^a(x),$	$\Gamma_{\Sigma}(E) \to \mathbb{R}$	derivatives of delta-functional on space of sections	def. , example
averaging of field observable	$\alpha^\ast \mapsto \underset{\Sigma}{\int} \alpha^\ast_a(x) \mathbf{\Phi}^a(x) \, dvol_\Sigma(x)$	$\Gamma_{\Sigma,cp}(E^\ast) \to Obs(E_{scp},\mathbf{L})$	observable-valued distribution	def.
algebra of quantum observables	$\left( Obs(E,\mathbf{L})_{\mu c},\, \star\right)$	$\mathbb{C}Alg$	non-commutative algebra structure on field observables	def. , def.

$\,$

field bundles

Definition

(fields and field histories)

Given a spacetime $\Sigma$ , then a type of fields on $\Sigma$ is a smooth fiber bundle (def. )

\array{E \\ \downarrow^{\mathrlap{fb}} \\ \Sigma }

called the field bundle,

Given a type of fields on $\Sigma$ this way, then a field history of that type on $\Sigma$ is a term of that type, hence is a smooth section (def. ) of this bundle, namely a smooth function of the form

\Phi \;\colon\; \Sigma \longrightarrow E

such that composed with the projection map it is the identity function, i.e. such that

fb \circ \Phi = id \phantom{AAAAAAA} \array{ && E \\ & {}^{\mathllap{\Phi}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \Sigma & = & \Sigma } \,.

The set of such sections/field histories is to be denoted

(18)

\Gamma_\Sigma(E) \;\coloneqq\; \left\{ \array{ && E \\ & {}^{\mathllap{\Phi}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \Sigma &=& \Sigma } \phantom{fb} \right\}

Remark

(field histories are histories of spatial field configurations)

Given a section $\Phi \in \Gamma_\Sigma(E)$ of the field bundle (def. ) and given a spacelike (def. ) submanifold $\Sigma_p \hookrightarrow \Sigma$ (def. ) of spacetime in codimension 1, then the restriction $\Phi\vert_{\Sigma_p}$ of $\Phi$ to $\Sigma_p$ may be thought of as a field configuration in space. As different spatial slices $\Sigma_p$ are chosen, one obtains such field configurations at different times. It is in this sense that the entirety of a section $\Phi \in \Gamma_\Sigma(E)$ is a history of field configurations, hence a field history (def ).

Remark

(possible field histories)

After we give the set $\Gamma_\Sigma(E)$ of field histories (18) differential geometric structure, below in example and example , we call it the space of field histories. This should be read as space of possible field histories; containing all field histories that qualify as being of the type specified by the field bundle $E$ .

After we obtain equations of motion below in def. , these serve as the “laws of nature” that field histories should obey, and they define the subspace of those field histories that do solve the equations of motion; this will be denoted

\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L}= 0} \overset{\phantom{AAA}}{\hookrightarrow} \Gamma_\Sigma(E)

and called the on-shell space of field histories (41).

For the time being, not to get distracted from the basic idea of quantum field theory, we will focus on the following simple special case of field bundles:

Example

(trivial vector bundle as a field bundle)

In applications the field fiber $F = V$ is often a finite dimensional vector space. In this case the trivial field bundle with fiber $F$ is of course a trivial vector bundle (def. ).

Choosing any linear basis $(\phi^a)_{a = 1}^s$ of the field fiber, then over Minkowski spacetime (def. ) we have canonical coordinates on the total space of the field bundle

( (x^\mu), ( \phi^a ) ) \,,

where the index $\mu$ ranges from $0$ to $p$ , while the index $a$ ranges from 1 to $s$ .

If this trivial vector bundle is regarded as a field bundle according to def. , then a field history $\Phi$ is equivalently an $s$ -tuple of real-valued smooth functions $\Phi^a \colon \Sigma \to \mathbb{R}$ on spacetime:

\Phi = ( \Phi^a )_{a = 1}^s \,.

Example

(field bundle for real scalar field)

If $\Sigma$ is a spacetime and if the field fiber

F \coloneqq \mathbb{R}

is simply the real line, then the corresponding trivial field bundle (def. )

\array{ \Sigma \times \mathbb{R} \\ {}^{\mathllap{pr_1}}\downarrow \\ \Sigma }

is the trivial real line bundle (a special case of example ) and the corresponding field type (def. ) is called the real scalar field on $\Sigma$ . A configuration of this field is simply a smooth function on $\Sigma$ with values in the real numbers:

(19)

\Gamma_\Sigma(\Sigma \times \mathbb{R}) \;\simeq\; C^\infty(\Sigma) \,.

Example

(field bundle for electromagnetic field)

On Minkowski spacetime $\Sigma$ (def. ), let the field bundle (def. ) be given by the cotangent bundle

E \coloneqq T^\ast \Sigma \,.

This is a trivial vector bundle (example ) with canonical field coordinates $(a_\mu)$ .

A section of this bundle, hence a field history, is a differential 1-form

A \in \Gamma_\Sigma(T^\ast \Sigma) = \Omega^1(\Sigma)

on spacetime (def. ). Interpreted as a field history of the electromagnetic field on $\Sigma$ , this is often called the vector potential. Then the de Rham differential (def. ) of the vector potential is a differential 2-form

F \coloneqq d A

known as the Faraday tensor. In the canonical coordinate basis 2-forms this may be expanded as

(20)

F = \underoverset{i = 1}{p}{\sum} E_i d x^0 \wedge d x^i + \underset{1 \leq i \lt j \leq p}{\sum} B_{i j} d x^i \wedge d x^j \,.

Here $(E_i)_{i = 1}^p$ are called the components of the electric field, while $(B_{i j})$ are called the components of the magnetic field.

Example

(field bundle for Yang-Mills field over Minkowski spacetime)

Let $\mathfrak{g}$ be a Lie algebra of finite dimension with linear basis $(t_\alpha)$ , in terms of which the Lie bracket is given by

(21)

[t_\alpha, t_\beta] \;=\; \gamma^\gamma{}_{\alpha \beta} t_\gamma \,.

Over Minkowski spacetime $\Sigma$ (def. ), consider then the field bundle which is the cotangent bundle tensored with the Lie algebra $\mathfrak{g}$

E \coloneqq T^\ast \Sigma \otimes \mathfrak{g} \,.

This is the trivial vector bundle (example ) with induced field coordinates

( a_\mu^\alpha ) \,.

A section of this bundle is a Lie algebra-valued differential 1-form

A \in \Gamma_\Sigma(T^\ast \Sigma \otimes \mathfrak{g}) = \Omega^1(\Sigma, \mathfrak{g}) \,.

with components

A^\ast(a_\mu^\alpha) = A^\alpha_\mu \,.

This is called a field history for Yang-Mills gauge theory (at least if $\mathfrak{g}$ is a semisimple Lie algebra, see example below).

For $\mathfrak{g} = \mathbb{R}$ is the line Lie algebra, this reduces to the case of the electromagnetic field (example ).

For $\mathfrak{g} = \mathfrak{su}(3)$ this is a field history for the gauge field of the strong nuclear force in quantum chromodynamics.

For readers familiar with the concepts of principal bundles and connections on a bundle we include the following example which generalizes the Yang-Mills field over Minkowski spacetime from example to the situation over general spacetimes.

Example

(general Yang-Mills field in fixed topological sector)

Let $\Sigma$ be any spacetime manifold and let $G$ be a compact Lie group with Lie algebra denoted $\mathfrak{g}$ . Let $P \overset{is}{\to} \Sigma$ be a $G$ -principal bundle and $\nabla_0$ a chosen connection on it, to be called the background $G$ -Yang-Mills field.

Then the field bundle (def. ) for $G$ -Yang-Mills theory in the topological sector $P$ is the tensor product of vector bundles

E \coloneqq \left(P \times^{ad}_G \mathfrak{g}\right) \otimes_\Sigma \left( T^\ast \Sigma \right)

of the adjoint bundle of $P$ and the cotangent bundle of $\Sigma$ .

With the choice of $\nabla_0$ , every (other) connection $\nabla$ on $P$ uniquely decomposes as

\nabla = \nabla_0 + A \,,

where

A \in \Gamma_\Sigma(E)

is a section of the above field bundle, hence a Yang-Mills field history.

The electromagnetic field (def. ) and the Yang-Mills field (def. , def. ) with differential 1-forms as field histories are the basic examples of gauge fields (we consider this in more detail below in Gauge symmetries). There are also higher gauge fields with differential n-forms as field histories:

Example

(field bundle for B-field)

On Minkowski spacetime $\Sigma$ (def. ), let the field bundle (def. ) be given by the skew-symmetrized tensor product of vector bundles of the cotangent bundle with itself

E \coloneqq \wedge^2_\Sigma T^\ast \Sigma \,.

This is a trivial vector bundle (example ) with canonical field coordinates $(b_{\mu \nu})$ subject to

b_{\mu \nu} \;=\; - b_{\nu \mu} \,.

A section of this bundle, hence a field history, is a differential 2-form (def. )

B \in \Gamma_\Sigma(\wedge^2_\Sigma T^\ast \Sigma) = \Omega^2(\Sigma)

on spacetime.

$\,$

space of field histories

Given any field bundle, we will eventually need to regard the set of all field histories $\Gamma_\Sigma(E)$ as a “smooth set” itself, a smooth space of sections, to which constructions of differential geometry apply (such as for the discussion of observables and states below ). Notably we need to be talking about differential forms on $\Gamma_\Sigma(E)$ .

However, a space of sections $\Gamma_\Sigma(E)$ does not in general carry the structure of a smooth manifold; and it carries the correct smooth structure of an infinite dimensional manifold only if $\Sigma$ is a compact space (see at manifold structure of mapping spaces). Even if it does carry infinite dimensional manifold structure, inspection shows that this is more structure than actually needed for the discussion of field theory. Namely it turns out below that all we need to know is what counts as a smooth family of sections/field histories, hence which functions of sets

\Phi_{(-)} \;\colon\; \mathbb{R}^n \longrightarrow \Gamma_\Sigma(E)

from any Cartesian space $\mathbb{R}^n$ (def. ) into $\Gamma_\Sigma(E)$ count as smooth functions, subject to some basic consistency condition on this choice.

This structure on $\Gamma_\Sigma(E)$ is called the structure of a diffeological space:

Definition

(diffeological space)

A diffeological space $X$ is

a set $X_s \in$ Set;
for each $n \in \mathbb{N}$ a choice of subset

$X(\mathbb{R}^n) \subset Hom_{Set}(\mathbb{R}^n_s, X_s) = \left\{ \mathbb{R}^n_s \to X_s \right\}$

of the set of functions from the underlying set $\mathbb{R}^n_s$ of $\mathbb{R}^n$ to $X_s$ , to be called the smooth functions or plots from $\mathbb{R}^n$ to $X$ ;
for each smooth function $f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}$ between Cartesian spaces (def. ) a choice of function

$f^\ast \;\colon\; X(\mathbb{R}^{n_2}) \longrightarrow X(\mathbb{R}^{n_1})$

to be thought of as the precomposition operation

$\left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{\Phi}{\to} X \right)$

such that

(constant functions are smooth)

$X(\mathbb{R}^0) = X_s \,,$
(functoriality)
1. If $id_{\mathbb{R}^n} \;\colon\; \mathbb{R}^n \to \mathbb{R}^n$ is the identity function on $\mathbb{R}^n$ , then $\left(id_{\mathbb{R}^n}\right)^\ast \;\colon\; X(\mathbb{R}^n) \to X(\mathbb{R}^n)$ is the identity function on the set of plots $X(\mathbb{R}^n)$ ;
2. If $\mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{g}{\to} \mathbb{R}^{n_3}$ are two composable smooth functions between Cartesian spaces (def. ), then pullback of plots along them consecutively equals the pullback along the composition:
  
  $f^\ast \circ g^\ast = (g \circ f)^\ast$
  
  i.e.
  
  $\array{ && X(\mathbb{R}^{n_2}) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1}) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3}) }$
(gluing)

If $\{ U_i \overset{f_i}{\to} \mathbb{R}^n\}_{i \in I}$ is a differentiably good open cover of a Cartesian space (def. ) then the function which restricts $\mathbb{R}^n$ -plots of $X$ to a set of $U_i$ -plots

$X(\mathbb{R}^n) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i)$

is a bijection onto the set of those tuples $(\Phi_i \in X(U_i))_{i \in I}$ of plots, which are “matching families” in that they agree on intersections:

$\phi_i\vert_{U_i \cap U_j} = \phi_j \vert_{U_i \cap U_j} \phantom{AAAAAA} \array{ && U_i \cap U_j \\ & \swarrow && \searrow \\ U_i && && U_j \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }$

Finally, given $X_1$ and $X_2$ two diffeological spaces, then a smooth function between them

f \;\colon\; X_1 \longrightarrow X_2

a function of the underlying sets

$f_s \;\colon\; (X_1)_s \longrightarrow (X_2)_s$

such that

for $\Phi \in X(\mathbb{R}^n)$ a plot of $X_1$ , then the composition $f_s \circ \Phi_s$ is a plot $f_\ast(\Phi)$ of $X_2$ :

$\array{ && \mathbb{R}^n \\ & {}^{\mathllap{\Phi}}\swarrow && \searrow^{\mathrlap{f_\ast(\Phi)}} \\ X_1 && \underset{f}{\longrightarrow} && X_2 } \,.$

(Stated more abstractly, this says simply that diffeological spaces are the concrete sheaves on the site of Cartesian spaces from def. .)

For more background on diffeological spaces see also geometry of physics – smooth sets.

Example

(Cartesian spaces are diffeological spaces)

Let $X$ be a Cartesian space (def. ) Then it becomes a diffeological space (def. ) by declaring its plots $\Phi \in X(\mathbb{R}^n)$ to the ordinary smooth functions $\Phi \colon \mathbb{R}^n \to X$ .

Under this identification, a function $f \;\colon\; (X_1)_s \to (X_2)_s$ between the underlying sets of two Cartesian spaces is a smooth function in the ordinary sense precisely if it is a smooth function in the sense of diffeological spaces.

Stated more abstractly, this statement is an example of the Yoneda embedding over a subcanonical site.

More generally, the same construction makes every smooth manifold a smooth set.

Example

(diffeological space of field histories)

Let $E \overset{fb}{\to} \Sigma$ be a smooth field bundle (def. ). Then the set $\Gamma_\Sigma(E)$ of field histories/sections (def. ) becomes a diffeological space (def. )

(22)

\Gamma_\Sigma(E) \in DiffeologicalSpaces

by declaring that a smooth family $\Phi_{(-)}$ of field histories, parameterized over any Cartesian space $U$ is a smooth function out of the Cartesian product manifold of $\Sigma$ with $U$

\array{ U \times \Sigma &\overset{\Phi_{(-)}(-)}{\longrightarrow}& E \\ (u,x) &\mapsto& \Phi_u(x) }

such that for each $u \in U$ we have $p \circ \Phi_{u}(-) = id_\Sigma$ , i.e.

\array{ && E \\ & {}^{\mathllap{\Phi_{(-)}(-)}}\nearrow & \downarrow^{\mathrlap{fb}} \\ U \times \Sigma &\underset{pr_2}{\longrightarrow}& \Sigma } \,.

The following example is included only for readers who wonder how infinite-dimensional manifolds fit in. Since we will never actually use infinite-dimensional manifold-structure, this example is may be ignored.

Example

(Fréchet manifolds are diffeological spaces)

Consider the particular type of infinite-dimensional manifolds called Fréchet manifolds. Since ordinary smooth manifolds $U$ are an example, for $X$ a Fréchet manifold there is a concept of smooth functions $U \to X$ . Hence we may give $X$ the structure of a diffeological space (def. ) by declaring the plots over $U$ to be these smooth functions $U \to X$ , with the evident postcomposition action.

It turns out that then that for $X$ and $Y$ two Fréchet manifolds, there is a natural bijection between the smooth functions $X \to Y$ between them regarded as Fréchet manifolds and [regarded as . Hence it does not matter which of the two perspectives we take (unless of course a more general than a enters the picture, at which point the second definition generalizes, whereas the first does not).]

Stated more abstractly, this means that Fréchet manifolds form a full subcategory of that of diffeological spaces (this prop.):

FrechetManifolds \hookrightarrow DiffeologicalSpaces \,.

If $\Sigma$ is a compact smooth manifold and $E \simeq \Sigma \times F \to \Sigma$ is a trivial fiber bundle with fiber $F$ a smooth manifold, then the set of sections $\Gamma_\Sigma(E)$ carries a standard structure of a Fréchet manifold (see at manifold structure of mapping spaces). Under the above inclusion of Fréchet manifolds into diffeological spaces, this smooth structure agrees with that from example (see this prop.)

Once the step from smooth manifolds to diffeological spaces (def. ) is made, characterizing the smooth structure of the space entirely by how we may probe it by mapping smooth Cartesian spaces into it, it becomes clear that the underlying set $X_s$ of a diffeological space $X$ is not actually crucial to support the concept: The space is already entirely defined structurally by the system of smooth plots it has, and the underlying set $X_s$ is recovered from these as the set of plots from the point $\mathbb{R}^0$ .

This is crucial for field theory: the spaces of field histories of fermionic fields (def. below) such as the Dirac field (example below) do not have underlying sets of points the way diffeological spaces have. Informally, the reason is that a point is a bosonic object, while and the nature of fermionic fields is the opposite of bosonic.

But we may just as well drop the mentioning of the underlying set $X_s$ in the definition of generalized smooth spaces. By simply stripping this requirement off of def. we obtain the following more general and more useful definition (still “bosonic”, though, the supergeometric version is def. below):

Definition

(smooth set)

A smooth set $X$ is

for each $n \in \mathbb{N}$ a choice of set

$X(\mathbb{R}^n) \in Set$

to be called the set of smooth functions or plots from $\mathbb{R}^n$ to $X$ ;
for each smooth function $f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}$ between Cartesian spaces a choice of function

$f^\ast \;\colon\; X(\mathbb{R}^{n_2}) \longrightarrow X(\mathbb{R}^{n_1})$

to be thought of as the precomposition operation

$\left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{\Phi}{\to} X \right)$

such that

(functoriality)
1. If $id_{\mathbb{R}^n} \;\colon\; \mathbb{R}^n \to \mathbb{R}^n$ is the identity function on $\mathbb{R}^n$ , then $\left(id_{\mathbb{R}^n}\right)^\ast \;\colon\; X(\mathbb{R}^n) \to X(\mathbb{R}^n)$ is the identity function on the set of plots $X(\mathbb{R}^n)$ .
2. If $\mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{g}{\to} \mathbb{R}^{n_3}$ are two composable smooth functions between Cartesian spaces, then consecutive pullback of plots along them equals the pullback along the composition:
  
  $f^\ast \circ g^\ast = (g \circ f)^\ast$
  
  i.e.
  
  $\array{ && X(\mathbb{R}^{n_2}) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1}) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3}) }$
(gluing)

If $\{ U_i \overset{f_i}{\to} \mathbb{R}^n\}_{i \in I}$ is a differentiably good open cover of a Cartesian space (def. ) then the function which restricts $\mathbb{R}^n$ -plots of $X$ to a set of $U_i$ -plots

$X(\mathbb{R}^n) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i)$

is a bijection onto the set of those tuples $(\Phi_i \in X(U_i))_{i \in I}$ of plots, which are “matching families” in that they agree on intersections:

$\phi_i\vert_{U_i \cap U_j} = \phi_j \vert_{U_i \cap U_j} \phantom{AAAA} \text{i.e.} \phantom{AAAA} \array{ && U_i \cap U_j \\ & \swarrow && \searrow \\ U_i && && U_j \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }$

Finally, given $X_1$ and $X_2$ two smooth sets, then a smooth function between them

f \;\colon\; X_1 \longrightarrow X_2

for each $n \in \mathbb{N}$ a function

$f_\ast(\mathbb{R}^n) \;\colon\; X_1(\mathbb{R}^n) \longrightarrow X_2(\mathbb{R}^n)$

such that

for each smooth function $g \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2}$ between Cartesian spaces we have

$g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2}) = f_\ast(\mathbb{R}^{n_1}) \circ g^\ast_1 \phantom{AAAAA} \text{i.e.} \phantom{AAAAA} \text{i.e.} \phantom{AAAAA} \array{ X_1(\mathbb{R}^{n_2}) &\overset{f_\ast(\mathbb{R}^{n_2})}{\longrightarrow}& X_2(\mathbb{R}^{n_2}) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1}) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1}) }$

Stated more abstractly, this simply says that smooth sets are the sheaves on the site of Cartesian spaces from def. .

Basing differential geometry on smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – smooth sets.

First we verify that the concept of smooth sets is a consistent generalization:

Example

(diffeological spaces are smooth sets)

Every diffeological space $X$ (def. ) is a smooth set (def. ) simply by forgetting its underlying set of points and remembering only its sets of plot.

In particular therefore each Cartesian space $\mathbb{R}^n$ is canonically a smooth set by example .

Moreover, given any two diffeological spaces, then the morphisms $f \colon X \to Y$ between them, regarded as diffeological spaces, are the same as the morphisms as smooth sets.

Stated more abstractly, this means that we have full subcategory inclusions

CartesianSpaces \overset{\phantom{AAA}}{\hookrightarrow} DiffeologicalSpaces \overset{\phantom{AAA}}{\hookrightarrow} SmoothSets \,.

Recall, for the next proposition , that in the definition of a smooth set $X$ the sets $X(\mathbb{R}^n)$ are abstract sets which are to be thought of as would-be smooth functions “ $\mathbb{R}^n \to X$ ”. Inside def. this only makes sense in quotation marks, since inside that definition the smooth set $X$ is only being defined, so that inside that definition there is not yet an actual concept of smooth functions of the form “ $\mathbb{R}^n \to X$ ”.

But now that the definition of smooth sets and of morphisms between them has been stated, and seeing that Cartesian space $\mathbb{R}^n$ are examples of smooth sets, by example , there is now an actual concept of smooth functions $\mathbb{R}^n \to X$ , namely as smooth sets. For the concept of smooth sets to be consistent, it ought to be true that this a posteriori concept of smooth functions from Cartesian spaces to smooth sets coincides wth the a priori concept, hence that we “may remove the quotation marks” in the above. The following proposition says that this is indeed the case:

Proposition

(plots of a smooth set really are the smooth functions into the smooth set)

Let $X$ be a smooth set (def. ). For $n \in \mathbb{R}$ , there is a natural function

Hom_{SmoothSet}(\mathbb{R}^n , X) \overset{\phantom{AA}\simeq\phantom{AA}}{\longrightarrow} X(\mathbb{R}^n)

from the set of homomorphisms of smooth sets from $\mathbb{R}^n$ (regarded as a smooth set via example ) to $X$ , to the set of plots of $X$ over $\mathbb{R}^n$ , given by evaluating on the identity plot $id_{\mathbb{R}^n}$ .

This function is a bijection.

This says that the plots of $X$ , which initially bootstrap $X$ into being as declaring the would-be smooth functions into $X$ , end up being the actual smooth functions into $X$ .

Proof

This elementary but profound fact is called the Yoneda lemma, here in its incarnation over the site of Cartesian spaces (def. ).

A key class of examples of smooth sets (def. ) that are not diffeological spaces (def. ) are universal smooth moduli spaces of differential forms:

Example

(universal smooth moduli spaces of differential forms)

For $k \in \mathbb{N}$ there is a smooth set (def. )

\mathbf{\Omega}^k \;\in\; SmoothSet

defined as follows:

for $n \in \mathbb{N}$ the set of plots from $\mathbb{R}^n$ to $\mathbf{\Omega}^k$ is the set of smooth differential k-forms on $\mathbb{R}^n$ (def. )

$\mathbf{\Omega}^k(\mathbb{R}^n) \;\coloneqq\; \Omega^k(\mathbb{R}^n)$
for $f \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2}$ a smooth function (def. ) the operation of pullback of plots along $f$ is just the pullback of differential forms $f^\ast$ from prop.

$\array{ \mathbb{R}^{n_1} && \Omega^k(\mathbb{R}^{n_1}) \\ \downarrow^{\mathrlap{f}} && \uparrow^{\mathrlap{f^\ast}} \\ \mathbb{R}^{n_2} && \Omega^k(\mathbb{R}^{n_2}) }$

That this is functorial is just the standard fact (7) from prop. .

For $k = 1$ the smooth set $\mathbf{\Omega}^0$ actually is a diffeological space, in fact under the identification of example this is just the real line:

\mathbf{\Omega}^0 \simeq \mathbb{R}^1 \,.

But for $k \geq 1$ we have that the set of plots on $\mathbb{R}^0 = \ast$ is a singleton

\mathbf{\Omega}^{k \geq 1}(\mathbb{R}^0) \simeq \{0\}

consisting just of the zero differential form. The only diffeological space with this property is $\mathbb{R}^0 = \ast$ itself. But $\mathbf{\Omega}^{k \geq 1}$ is far from being that trivial: even though its would-be underlying set is a single point, for all $n \geq k$ it admits an infinite set of plots. Therefore the smooth sets $\mathbf{\Omega}^k$ for $k \geq$ are not diffeological spaces.

That the smooth set $\mathbf{\Omega}^k$ indeed deserves to be addressed as the universal moduli space of differential k-forms follows from prop. : The universal moduli space of $k$ -forms ought to carry a universal differential $k$ -forms $\omega_{univ} \in \Omega^k(\mathbf{\Omega}^k)$ such that every differential $k$ -form $\omega$ on any $\mathbb{R}^n$ arises as the pullback of differential forms of this universal one along some modulating morphism $f_\omega \colon X \to \mathbf{\Omega}^k$ :

\array{ \{\omega\} &\overset{(f_\omega)^\ast}{\longleftarrow}& \{\omega_{univ}\} \\ \\ X &\underset{f_\omega}{\longrightarrow}& \mathbf{\Omega}^k }

But with prop. this is precisely what the definition of the plots of $\mathbf{\Omega}^k$ says.

Similarly, all the usual operations on differential form now have their universal archetype on the universal moduli spaces of differential forms

In particular, for $k \in \mathbb{N}$ there is a canonical morphism of smooth sets of the form

\mathbf{\Omega}^k \overset{\mathbf{d}}{\longrightarrow} \mathbf{\Omega}^{k+1}

defined over $\mathbb{R}^n$ by the ordinary de Rham differential (def. )

(23)

\Omega^k(\mathbb{R}^n) \overset{d}{\longrightarrow} \Omega^{k+1}(\mathbb{R}^n) \,.

That this satisfies the compatibility with precomposition of plots

\array{ \mathbb{R}^{n_1} && \Omega^k(\mathbb{R}^{n_1}) &\overset{d}{\longrightarrow}& \Omega^{k+1}(\mathbb{R}^{n_1}) \\ {}^{\mathllap{f}}\downarrow && \uparrow^{\mathrlap{f^\ast}} && \uparrow^{\mathrlap{f^\ast}} \\ \mathbb{R}^{n_2} && \Omega^k(\mathbb{R}^{n_2}) &\underset{d}{\longrightarrow}& \Omega^k( \mathbb{R}^{n_2} ) }

is just the compatibility of pullback of differential forms with the de Rham differential of from prop. .

The upshot is that we now have a good definition of differential forms on any diffeological space and more generally on any smooth set:

Definition

(differential forms on smooth sets)

Let $X$ be a diffeological space (def. ) or more generally a smooth set (def. ) then a differential k-form $\omega$ on $X$ is equivalently a morphism of smooth sets

X \longrightarrow \mathbf{\Omega}^k

from $X$ to the universal smooth moduli space of differential froms from example .

Concretely, by unwinding the definitions of $\mathbf{\Omega}^k$ and of morphisms of smooth sets, this means that such a differential form is:

for each $n \in \mathbb{N}$ and each plot $\mathbb{R}^n \overset{\Phi}{\to} X$ an ordinary differential form

$\Phi^\ast(\omega) \in \Omega^\bullet(\mathbb{R}^n)$

such that

for each smooth function $f \;\colon\; \mathbb{R}^{n_1} \to \mathbb{R}^{n_2}$ between Cartesian spaces the ordinary pullback of differential forms along $f$ is compatible with these choices, in that for every plot $\mathbb{R}^{n_2} \overset{\Phi}{\to} X$ we have

$f^\ast\left(\Phi^\ast(\omega)\right) = ( f^\ast \Phi )^\ast(\omega)$

i.e.

$\array{ \mathbb{R}^{n_1} && \overset{f}{\longrightarrow} && \mathbb{R}^{n_2} \\ & {}_{\mathllap{f^\ast \Phi}}\searrow && \swarrow_{\mathrlap{\Phi}} \\ && X } \phantom{AAAA} \array{ \Omega^\bullet( \mathbb{R}^{n_1} ) && \overset{f^\ast}{\longleftarrow} && \Omega^\bullet(\mathbb{R}^{n_2}) \\ & {}_{\mathllap{(f^\ast \Phi)^\ast}}\nwarrow && \nearrow_{\mathrlap{\Phi^\ast}} \\ && \Omega^\bullet(X) } \,.$

We write $\Omega^\bullet(X)$ for the set of differential forms on the smooth set $X$ defined this way.

Moreover, given a differential k-form

X \overset{\omega}{\longrightarrow} \mathbf{\Omega}^k

on a smooth set $X$ this way, then its de Rham differential $d \omega \in \Omega^{k+1}(X)$ is given by the composite of morphisms of smooth sets with the universal de Rham differential from (23):

(24)

d \omega \;\colon\; X \overset{\omega}{\longrightarrow} \mathbf{\Omega}^k \overset{d}{\longrightarrow} \mathbf{\Omega}^{k+1} \,.

Explicitly this means simply that for $\Phi \colon U \to X$ a plot, then

\Phi^\ast (d\omega) \;=\; d\left( \Phi^\ast \omega\right) \;\in\; \Omega^{k+1}(U) \,.

The usual operations on ordinary differential forms directly generalize plot-wise to differential forms on diffeological spaces and more generally on smooth sets:

Definition

(exterior differential and exterior product on smooth sets)

Let $X$ be a diffeological space (def. ) or more generally a smooth set (def. ). Then

For $\omega \in \Omega^n(X)$ a differential form on $X$ (def. ) its exterior differential

$d \omega \in \Omega^{n+1}(X)$

is defined on any plot $\mathbb{R}^n \overset{\Phi}{\to} X$ as the ordinary exterior differential of the pullback of $\omega$ along that plot:

$\Phi^\ast(d \omega) \coloneqq d \Phi^\ast(\omega) \,.$
For $\omega_1 \in \Omega^{n_1}$ and $\omega_2 \in \Omega^{n_2}(X)$ two differential forms on $X$ (def. ) then their exterior product

$\omega_1 \wedge \omega_2 \;\in\; \Omega^{n_1 + n_2}(X)$

is the differential form defined on any plot $\mathbb{R}^n \overset{\Phi}{\to} X$ as the ordinary exterior product of the pullback of th differential forms $\omega_1$ and $\omega_2$ to this plot:

$\Phi^\ast(\omega_1 \wedge \omega_2) \;\coloneqq\; \Phi^\ast(\omega_1) \wedge \Phi^\ast(\omega_2) \,.$

$\,$

Infinitesimal geometry

It is crucial in field theory that we consider field histories not only over all of spacetime, but also restricted to submanifolds of spacetime. Or rather, what is actually of interest are the restrictions of the field histories to the infinitesimal neighbourhoods (example below) of these submanifolds. This appears notably in the construction of phase spaces below. Moreover, fermion fields such as the Dirac field (example below) take values in graded infinitesimal spaces, called super spaces (discussed below). Therefore “infinitesimal geometry”, sometimes called formal geometry (as in “formal scheme”) or synthetic differential geometry or synthetic differential supergeometry, is a central aspect of field theory.

In order to mathematically grasp what infinitesimal neighbourhoods are, we appeal to the first magic algebraic property of differential geometry from prop. , which says that we may recognize smooth manifolds $X$ dually in terms of their commutative algebras $C^\infty(X)$ of smooth functions on them

C^\infty(-) \;\colon\; SmoothManifolds \overset{\phantom{AAA}}{\hookrightarrow} (\mathbb{R} Algebras)^{op} \,.

But since there are of course more algebras $A \in \mathbb{R}Algebras$ than arise this way from smooth manifolds, we may turn this around and try to regard any algebra $A$ as defining a would-be space, which would have $A$ as its algebra of functions.

For example an infinitesimally thickened point should be a space which is “so small” that every smooth function $f$ on it which vanishes at the origin takes values so tiny that some finite power of them is not just even more tiny, but actually vanishes:

Definition

(infinitesimally thickened Cartesian space)

An infinitesimally thickened point

\mathbb{D} \coloneqq Spec(A)

is represented by a commutative algebra $A \in \mathbb{R}Alg$ which as a real vector space is a direct sum

A \simeq_{\mathbb{R}} \langle 1 \rangle \oplus V

of the 1-dimensional space $\langle 1 \rangle = \mathbb{R}$ of multiples of 1 with a finite dimensional vector space $V$ that is a nilpotent ideal in that for each element $a \in V$ there exists a natural number $n \in \mathbb{N}$ such that

a^{n+1} = 0 \,.

More generally, an infinitesimally thickened Cartesian space

\mathbb{R}^n \times \mathbb{D} \;\coloneqq\; \mathbb{R}^n \times Spec(A)

is represented by a commutative algebra

C^\infty(\mathbb{R}^n) \otimes A \;\in\; \mathbb{R} Alg

which is the tensor product of algebras of the algebra of smooth functions $C^\infty(\mathbb{R}^n)$ on an actual Cartesian space of some dimension $n$ (example ), with an algebra of functions $A \simeq_{\mathbb{R}} \langle 1\rangle \oplus V$ of an infinitesimally thickened point, as above.

We say that a smooth function between two infinitesimally thickened Cartesian spaces

\mathbb{R}^{n_1} \times Spec(A_1) \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \times Spec(A_2)

is by definition dually an $\mathbb{R}$ -algebra homomorphism of the form

C^\infty(\mathbb{R}^{n_1}) \otimes A_1 \overset{f^\ast}{\longleftarrow} C^\infty(\mathbb{R}^{n_2}) \otimes A_2 \,.

Example

(infinitesimal neighbourhoods in the real line )

Consider the quotient algebra of the formal power series algebra $\mathbb{R}[ [\epsilon] ]$ in a single parameter $\epsilon$ by the ideal generated by $\epsilon^2$ :

(\mathbb{R}[ [\epsilon] ])/(\epsilon^2) \;\simeq_{\mathbb{R}}\; \mathbb{R} \oplus \epsilon \mathbb{R} \,.

(This is sometimes called the algebra of dual numbers, for no good reason.) The underlying real vector space of this algebra is, as show, the direct sum of the multiples of 1 with the multiples of $\epsilon$ . A general element in this algebra is of the form

a + b \epsilon \in (\mathbb{R}[\epsilon])/(\epsilon^2)

where $a,b \in \mathbb{R}$ are real numbers. The product in this algebra is given by “multiplying out” as usual, and discarding all terms proportional to $\epsilon^2$ :

\left( a_1 + b_1 \epsilon \right) \cdot \left( a_2 + b_2 \epsilon \right) \;=\; a_1 a_2 + ( a_1 b_2 + b_1 a_2 ) \epsilon \,.

We may think of an element $a + b \epsilon$ as the truncation to first order of a Taylor series at the origin of a smooth function on the real line

f \;\colon\; \mathbb{R} \to \mathbb{R}

where $a = f(0)$ is the value of the function at the origin, and where $b = \frac{\partial f}{\partial x}(0)$ is its first derivative at the origin.

Therefore this algebra behaves like the algebra of smooth function on an infinitesimal neighbourhood $\mathbb{D}^1$ of $0 \in \mathbb{R}$ which is so tiny that its elements $\epsilon \in \mathbb{D}^1 \hookrightarrow \mathbb{R}$ become, upon squaring them, not just tinier, but actually zero:

\epsilon^2 = 0 \,.

This intuitive picture is now made precise by the concept of infinitesimally thickened points def. , if we simply set

\mathbb{D}^1 \;\coloneqq\; Spec\left( \mathbb{R}[ [\epsilon] ]/(\epsilon^2) \right)

and observe that there is the inclusion of infinitesimally thickened Cartesian spaces

\mathbb{D}^1 \overset{\phantom{AA}i\phantom{AA} }{\hookrightarrow} \mathbb{R}^1

which is dually given by the algebra homomorphism

\array{ \mathbb{R} \oplus \epsilon \mathbb{R} &\overset{i^\ast}{\longleftarrow}& C^\infty(\mathbb{R}^1) \\ f(0) + \frac{\partial f}{\partial x}(0) &\longleftarrow& \{f\} }

which sends a smooth function to its value $f(0)$ at zero plus $\epsilon$ times its derivative at zero. Observe that this is indeed a homomorphism of algebras due to the product law of differentiation, which says that

\begin{aligned} i^\ast(f \cdot g) & = (f \cdot g)(0) + \frac{\partial f \cdot g}{\partial x}(0) \epsilon \\ & = f(0) \cdot g(0) + \left( \frac{\partial f}{\partial x}(0) \cdot g(0) + f(0) \cdot \frac{\partial g}{\partial x}(0) \right) \epsilon \\ & = \left( f(0) + \frac{\partial f}{\partial x}(0) \epsilon \right) \cdot \left( g(0) + \frac{\partial g}{\partial x}(0) \epsilon \right) \end{aligned}

Hence we see that restricting a smooth function to the infinitesimal neighbourhood of a point is equivalent to restricting attention to its Taylor series to the given order at that point:

\array{ \mathbb{D}^1 &\overset{i}{\hookrightarrow}& \mathbb{R}^1 \\ & {}_{\mathllap{(\epsilon \mapsto f(0) + \frac{\partial f}{\partial x}(0) \epsilon) }}\searrow & \downarrow_{\mathrlap{f}} \\ && \mathbb{R}^1 }

Similarly for each $k \in \mathbb{N}$ the algebra

(\mathbb{R}[ [ \epsilon ] ])/(\epsilon^{k+1})

may be thought of as the algebra of Taylor series at the origin of $\mathbb{R}$ of smooth functions $\mathbb{R} \to \mathbb{R}$ , where all terms of order higher than $k$ are discarded. The corresponding infinitesimally thickened point is often denoted

\mathbb{D}^1(k) \;\coloneqq\; Spec\left( \left(\mathbb{R}[ [\epsilon] ]\right)/(\epsilon^{k+1}) \right) \,.

This is now the subobject of the real line

\mathbb{D}^1(k) \overset{\phantom{AAA}}{\hookrightarrow} \mathbb{R}^1

on those elements $\epsilon$ such that $\epsilon^{k+1} = 0$ .

(Kock 81, Kock 10)

The following example shows that infinitesimal thickening is invisible for ordinary spaces when mapping out of these. In contrast example further below shows that the morphisms into an ordinary space out of an infinitesimal space are interesting: these are tangent vectors and their higher order infinitesimal analogs.

Example

(infinitesimal line $\mathbb{D}^1$ has unique global point)

For $\mathbb{R}^n$ any ordinary Cartesian space (def. ) and $D^1(k) \hookrightarrow \mathbb{R}^1$ the order- $k$ infinitesimal neighbourhood of the origin in the real line from example , there is exactly only one possible morphism of infinitesimally thickened Cartesian spaces from $\mathbb{R}^n$ to $\mathbb{D}^1(k)$ :

\array{ \mathbb{R}^n && \overset{\exists !}{\longrightarrow} &6 \mathbb{D}^1(k) \\ & {}_{\mathllap{\exists !}}\searrow && \nearrow_{\mathrlap{\exists !}} \\ && \mathbb{R}^0 = \ast } \,.

Proof

By definition such a morphism is dually an algebra homomorphism

C^\infty(\mathbb{R}^n) \overset{f^\ast}{\longleftarrow} \left( \mathbb{R}[ [\epsilon] ])/(\epsilon^{k+1} \right) \simeq_{\mathbb{R}} \mathbb{R} \oplus \mathcal{O}(\epsilon)

from the higher order “algebra of dual numbers” to the algebra of smooth functions (example ).

Now this being an $\mathbb{R}$ -algebra homomorphism, its action on the multiples $c \in \mathbb{R}$ of the identity is fixed:

f^\ast(1) = 1 \,.

All the remaining elements are proportional to $\epsilon$ , and hence are nilpotent. However, by the homomorphism property of an algebra homomorphism it follows that it must send nilpotent elements $\epsilon$ to nilpotent elements $f(\epsilon)$ , because

\begin{aligned} \left(f^\ast(\epsilon)\right)^{k+1} & = f^\ast\left( \epsilon^{k+1}\right) \\ & = f^\ast(0) \\ & = 0 \end{aligned}

But the only nilpotent element in $C^\infty(\mathbb{R}^n)$ is the zero element, and hence it follows that

f^\ast(\epsilon) = 0 \,.

Thus $f^\ast$ as above is uniquely fixed.

Example

(synthetic tangent vector fields)

Let $\mathbb{R}^n$ be a Cartesian space (def. ), regarded as an infinitesimally thickened Cartesian space (def. ) and consider $\mathbb{D}^1 \coloneqq Spec( (\mathbb{R}[ [\epsilon] ])/(\epsilon^2) )$ the first order infinitesimal line from example .

Then homomorphisms of infinitesimally thickened Cartesian spaces of the form

\array{ \mathbb{R}^n \times \mathbb{D}^1 && \overset{\tilde v}{\longrightarrow} && \mathbb{R}^n \\ & {}_{\mathllap{pr_1}}\searrow && \swarrow_{\mathrlap{id}} \\ && \mathbb{R}^n }

hence smoothly $X$ -parameterized collections of morphisms

\tilde v_x \;\colon\; \mathbb{D}^1 \longrightarrow \mathbb{R}^n

which send the unique base point $\Re(\mathbb{D}^1) = \ast$ (example ) to $x \in \mathbb{R}^n$ , are in natural bijection with tangent vector fields $v \in \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n)$ (example ).

Proof

By definition, the morphisms in question are dually $\mathbb{R}$ -algebra homomorphisms of the form

(C^\infty(\mathbb{R}^n) \oplus \epsilon C^\infty(\mathbb{R}^n)) \longleftarrow C^\infty(\mathbb{R}^n)

which are the identity modulo $\epsilon$ . Such a morphism has to take any function $f \in C^\infty(\mathbb{R}^n)$ to

f + (\partial f) \epsilon

for some smooth function $(\partial f) \in C^\infty(\mathbb{R}^n)$ . The condition that this assignment makes an algebra homomorphism is equivalent to the statement that for all $f_1,f_2 \in C^\infty(\mathbb{R}^n)$ we have

(f_1 f_2 + (\partial (f_1 f_2))\epsilon ) \;=\; (f_1 + (\partial f_1) \epsilon) \cdot (f_2 + (\partial f_2) \epsilon) \,.

Multiplying this out and using that $\epsilon^2 = 0$ , this is equivalent to

\partial(f_1 f_2) = (\partial f_1) f_2 + f_1 (\partial f_2) \,.

This in turn means equivalently that $\partial\colon C^\infty(\mathbb{R}^n)\to C^\infty(\mathbb{R}^n)$ is a derivation.

With this the statement follows with the third magic algebraic property of smooth functions (prop. ): derivations of smooth functions are vector fields.

We need to consider infinitesimally thickened spaces more general than the thickenings of just Cartesian spaces in def. . But just as Cartesian spaces (def. ) serve as the local test geometries to induce the general concept of diffeological spaces and smooth sets (def. ), so using infinitesimally thickened Cartesian spaces as test geometries immediately induces the corresponding generalization of smooth sets with infinitesimals:

Definition

(formal smooth set)

A formal smooth set $X$ is

for each infinitesimally thickened Cartesian space $\mathbb{R}^n \times Spec(A)$ (def. ) a set

$X(\mathbb{R}^n \times Spec(A)) \in Set$

to be called the set of smooth functions or plots from $\mathbb{R}^n \times Spec(A)$ to $X$ ;
for each smooth function $f \;\colon\; \mathbb{R}^{n_1} \times Spec(A_1) \longrightarrow \mathbb{R}^{n_2} \times Spec(A_2)$ between infinitesimally thickened Cartesian spaces a choice of function

$f^\ast \;\colon\; X(\mathbb{R}^{n_2} \times Spec(A_2)) \longrightarrow X(\mathbb{R}^{n_1} \times Spec(A_1))$

to be thought of as the precomposition operation

$\left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{\Phi}{\to} X \right)$

such that

(functoriality)
1. If $id_{\mathbb{R}^n \times Spec(A)} \;\colon\; \mathbb{R}^n \times Spec(A) \to \mathbb{R}^n \times Spec(A)$ is the identity function on $\mathbb{R}^n \times Spec(A)$ , then $\left(id_{\mathbb{R}^n \times Spec(A)}\right)^\ast \;\colon\; X(\mathbb{R}^n \times Spec(A)) \to X(\mathbb{R}^n \times Spec(A))$ is the identity function on the set of plots $X(\mathbb{R}^n \times Spec(A))$ ;
2. If $\mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{g}{\to} \mathbb{R}^{n_3} \times Spec(A_3)$ are two composable smooth functions between infinitesimally thickened Cartesian spaces, then pullback of plots along them consecutively equals the pullback along the composition:
  
  $f^\ast \circ g^\ast = (g \circ f)^\ast$
  
  i.e.
  
  $\array{ && X(\mathbb{R}^{n_2} \times Spec(A_2)) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1} \times Spec(A_1)) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3} \times Spec(A_3)) }$
(gluing)

If $\{ U_i \times Spec(A) \overset{f_i \times id_{Spec(A)}}{\to} \mathbb{R}^n \times Spec(A)\}_{i \in I}$ is such that

$\{ U_i \overset{f_i }{\to} \mathbb{R}^n \}_{i \in I}$

in a differentiably good open cover (def. ) then the function which restricts $\mathbb{R}^n \times Spec(A)$ -plots of $X$ to a set of $U_i \times Spec(A)$ -plots

$X(\mathbb{R}^n \times Spec(A)) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i \times Spec(A))$

is a bijection onto the set of those tuples $(\Phi_i \in X(U_i))_{i \in I}$ of plots, which are “matching families” in that they agree on intersections:

$\phi_i\vert_{((U_i \cap U_j) \times Spec(A)} = \phi_j \vert_{(U_i \cap U_j)\times Spec(A)}$

i.e.

$\array{ && (U_i \cap U_j) \times Spec(A) \\ & \swarrow && \searrow \\ U_i\times Spec(A) && && U_j \times Spec(A) \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }$

Finally, given $X_1$ and $X_2$ two formal smooth sets, then a smooth function between them

f \;\colon\; X_1 \longrightarrow X_2

for each infinitesimally thickened Cartesian space $\mathbb{R}^n \times Spec(A)$ (def. ) a function

$f_\ast(\mathbb{R}^n \times Spec(A)) \;\colon\; X_1(\mathbb{R}^n \times Spec(A)) \longrightarrow X_2(\mathbb{R}^n \times Spec(A))$

such that

for each smooth function $g \colon \mathbb{R}^{n_1} \times Spec(A_1) \to \mathbb{R}^{n_2} \times Spec(A_2)$ between infinitesimally thickened Cartesian spaces we have

$g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2} \times Spec(A_2)) = f_\ast(\mathbb{R}^{n_1} \times Spec(A_1)) \circ g^\ast_1$

i.e.

$\array{ X_1(\mathbb{R}^{n_2} \times Spec(A_2)) &\overset{f_\ast(\mathbb{R}^{n_2}\times Spec(A_2) )}{\longrightarrow}& X_2(\mathbb{R}^{n_2} \times Spec(A_2)) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1} \times Spec(A_1)) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1} \times Spec(A_1)) }$

(Dubuc 79)

Basing infinitesimal geometry on formal smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – manifolds and orbifolds.

We have the evident generalization of example to smooth geometry with infinitesimals:

Example

(infinitesimally thickened Cartesian spaces are formal smooth sets)

For $X$ an infinitesimally thickened Cartesian space (def. ), it becomes a formal smooth set according to def. by taking its plots out of some $\mathbb{R}^n \times \mathbb{D}$ to be the homomorphism of infinitesimally thickened Cartesian spaces:

X(\mathbb{R}^n \times \mathbb{D}) \;\coloneqq\; Hom_{FormalCartSp}( \mathbb{R}^n \times \mathbb{D}, X ) \,.

(Stated more abstractly, this is an instance of the Yoneda embedding over a subcanonical site.)

Example

(smooth sets are formal smooth sets)

Let $X$ be a smooth set (def. ). Then $X$ becomes a formal smooth set (def. ) by declaring the set of plots $X(\mathbb{R}^n \times \mathbb{D})$ over an infinitesimally thickened Cartesian space (def. ) to be equivalence classes of pairs

\mathbb{R}^n \times \mathbb{D} \longrightarrow \mathbb{R}^{k} \,, \phantom{AA} \mathbb{R}^k \longrightarrow X

of a morphism of infinitesimally thickened Cartesian spaces and of a plot of $X$ , as shown, subject to the equivalence relation which identifies two such pairs if there exists a smooth function $f \colon \mathbb{R}^k \to \mathbb{R}^{k'}$ such that

\array{ && \mathbb{R}^n \times \mathbb{D} \\ & \swarrow && \searrow \\ \mathbb{R}^k && \overset{f}{\longrightarrow} && \mathbb{R}^{k'} \\ \mathbb{R}^k && \underset{f}{\longrightarrow} && \mathbb{R}^{k'} \\ & \searrow && \swarrow \\ && X }

Stated more abstractly this says that $X$ as a formal smooth set is the left Kan extension (see this example) of $X$ as a smooth set along the functor that includes Cartesian spaces (def. ) into infinitesimally thickened Cartesian spaces (def. ).

Definition

(reduction and infinitesimal shape)

For $\mathbb{R}^n \times \mathbb{D}$ an infinitesimally thickened Cartesian space (def. ) we say that the underlying ordinary Cartesian space $\mathbb{R}^n$ (def. ) is its reduction

\Re\left( \mathbb{R}^n \times \mathbb{D} \right) \;\coloneqq\; \mathbb{R}^n \,.

There is the canonical inclusion morphism

\Re\left( \mathbb{R}^n \times \mathbb{D} \right) = \mathbb{R}^n \overset{\phantom{AAAA}}{\hookrightarrow} \mathbb{R}^n \times \mathbb{D}

which dually corresponds to the homomorphism of commutative algebras

C^\infty(\mathbb{R}^n) \longleftarrow C^\infty(\mathbb{R}^n) \otimes_{\mathbb{R}} A

which is the identity on all smooth functions $f \in C^\infty(\mathbb{R}^n)$ and is zero on all elements $a \in V \subset A$ in the nilpotent ideal of $A$ (as in example ).

Given any formal smooth set $X$ , we say that its infinitesimal shape or de Rham shape (also: de Rham stack) is the formal smooth set $\Im X$ (def. ) defined to have as plots the reductions of the plots of $X$ , according to the above:

(\Im X)( U ) \;\coloneqq\: X(\Re(U)) \,.

There is a canonical morphism of formal smooth set

\eta_X \;\colon\; X \longrightarrow \Im X

which takes a plot

U = \mathbb{R}^n \times \mathbb{D} \overset{f}{\longrightarrow} X

to the composition

\mathbb{R}^n \hookrightarrow \mathbb{R}^n \times \mathbb{D} \overset{f}{\hookrightarrow} X

regarded as a plot of $\Im X$ .

Example

(mapping space out of an infinitesimally thickened Cartesian space)

Let $X$ be an infinitesimally thickened Cartesian space (def. ) and let $Y$ be a formal smooth set (def. ). Then the mapping space

[X,Y] \;\in\; FormalSmoothSet

of smooth functions from $X$ to $Y$ is the formal smooth set whose $U$ -plots are the morphisms of formal smooth sets from the Cartesian product of infinitesimally thickened Cartesian spaces $U \times X$ to $Y$ , hence the $U \times X$ -plots of $Y$ :

[X,Y](U) \;\coloneqq\; Y(U \times X) \,.

Example

(synthetic tangent bundle)

Let $X \coloneqq \mathbb{R}^n$ be a Cartesian space (def. ) regarded as an infinitesimally thickened Cartesian space () and thus regarded as a formal smooth set (def. ) by example . Consider the infinitesimal line

\mathbb{D}^1 \hookrightarrow \mathbb{R}^1

from example . Then the mapping space $[\mathbb{D}^1, X]$ (example ) is the total space of the tangent bundle $T X$ (example ). Moreover, under restriction along the reduction $\ast \longrightarrow \mathbb{D}^1$ , this is the full tangent bundle projection, in that there is a natural isomorphism of formal smooth sets of the form

\array{ T X &\simeq& [\mathbb{D}^1, X] \\ {}^{\mathllap{tb}}\downarrow && \downarrow^{\mathrlap{ [ \ast \to \mathbb{D}^1, X ] }} \\ X &\simeq& [\ast, X] }

In particular this implies immediately that smooth sections (def. ) of the tangent bundle

\array{ && [\mathbb{D}^1, X] & \simeq T X \\ & {}^{\mathllap{v}}\nearrow & \downarrow \\ X &=& X }

are equivalently morphisms of the form

\array{ && X \\ & {}^{\mathllap{\tilde v}}\nearrow & \downarrow^{\mathrlap{id}} \\ X \times \mathbb{D}^1 &\underset{pr_1}{\longrightarrow}& X }

which we had already identified with tangent vector fields (def. ) in example .

Proof

This follows by an analogous argument as in example , using the Hadamard lemma.

While in infinitesimally thickened Cartesian spaces (def. ) only infinitesimals to any finite order may exist, in formal smooth sets (def. ) we may find infinitesimals to any arbitrary finite order:

Example

(infinitesimal neighbourhood)

Let $X$ be a formal smooth sets (def. ) $Y \hookrightarrow X$ a sub-formal smooth set. Then the infinitesimal neighbourhood to arbitrary infinitesimal order of $Y$ in $X$ is the formal smooth set $N_X Y$ whose plots are those plots of $X$

\mathbb{R}^n \times Spec(A) \overset{f}{\longrightarrow} X

such that their reduction (def. )

\mathbb{R}^n \hookrightarrow \mathbb{R}^n \times Spec(A) \overset{f}{\longrightarrow} X

factors through a plot of $Y$ .

This allows to grasp the restriction of field histories to the infinitesimal neighbourhood of a submanifold of spacetime, which will be crucial for the discussion of phase spaces below.

Definition

(field histories on infinitesimal neighbourhood of submanifold of spacetime)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) and let $S \hookrightarrow \Sigma$ be a submanifold of spacetime.

We write $N_\Sigma(S) \hookrightarrow \Sigma$ for its infinitesimal neighbourhood in $\Sigma$ (def. ).

Then the set of field histories restricted to $S$ , to be denoted

(25)

\Gamma_{S}(E) \coloneqq \Gamma_{N_\Sigma(S)}( E\vert_{N_\Sigma S} ) \in \mathbf{H}

is the set of section of $E$ restricted to the infinitesimal neighbourhood $N_\Sigma(S)$ (example ).

$\,$

We close the discussion of infinitesimal differential geometry by explaining how we may recover the concept of smooth manifolds inside the more general formal smooth sets (def./prop. below). The key point is that the presence of infinitesimals in the theory allows an intrinsic definition of local diffeomorphisms/formally étale morphism (def. and example below). It is noteworthy that the only role this concept plays in the development of field theory below is that smooth manifolds admit triangulations by smooth singular simplices (def. ) so that the concept of fiber integration of differential forms is well defined over manifolds.

Definition

(local diffeomorphism of formal smooth sets)

Let $X,Y$ be formal smooth sets (def. ). Then a morphism between them is called a local diffeomorphism or formally étale morphism, denoted

f \;\colon\; X \overset{et}{\longrightarrow} Y \,,

if $f$ if for each infinitesimally thickened Cartesian space (def. ) $\mathbb{R}^n \times \mathbb{D}$ we have a natural identification between the $\mathbb{R}^n \times \mathbb{D}$ -plots of $X$ with those $\mathbb{R}^n n\times \mathbb{D}$ -plots of $Y$ whose reduction (def. ) comes from an $\mathbb{R}^n$ -plot of $X$ , hence if we have a natural fiber product of sets of plots

X(\mathbb{R}^n \times \mathbb{D}) \;\simeq\; Y(\mathbb{R}^n \times \mathbb{D}) \underset{Y(\mathbb{R}^n)}{\times^f} X(\mathbb{R}^n)

i. e.

\array{ && X(\mathbb{R}^n \times \mathbb{D}) \\ & \swarrow && \searrow \\ Y(\mathbb{R}^n \times \mathbb{D}) && \text{(pb)} && X(\mathbb{R}^n) \\ & \searrow && \swarrow \\ && Y(\mathbb{R}^n ) }

for all infinitesimally thickened Cartesian spaces $\mathbb{R}^n \times \mathbb{D}$ .

Stated more abstractly, this means that the naturality square of the unit of the infinitesimal shape $\Im$ (def. ) is a pullback square

\array{ X &\overset{\eta_X}{\longrightarrow}& \Im X \\ {}^{\mathllap{f}}\downarrow &\text{(pb)}& \downarrow^{\mathrlap{\Im f}} \\ Y &\underset{\eta_Y}{\longrightarrow}& \Im Y }

(Khavkine-Schreiber 17, def. 3.1)

Example

(local diffeomorphism between Cartesian spaces from the general definition)

For $X,Y \in CartSp$ two ordinary Cartesian spaces (def. ), regarded as formal smooth sets by example then a morphism $f \colon X \to Y$ between them is a local diffeomorphism in the general sense of def. precisely if it is so in the ordinary sense of def. .

(Khavkine-Schreiber 17, prop. 3.2)

Definition/Proposition

(smooth manifolds)

A smooth manifold $X$ of dimension $n \in \mathbb{N}$ is

a diffeological space (def. )

such that

there exists an indexed set $\{ \mathbb{R}^n \overset{\phi_i}{\to} X\}_{i \in I}$ of morphisms of formal smooth sets (def. ) from Cartesian spaces $\mathbb{R}^n$ (def. ) (regarded as formal smooth sets via example , example and example ) into $X$ , (regarded as a formal smooth set via example and example ) such that
1. every point $x \in X_s$ is in the image of at least one of the $\phi_i$ ;
2. every $\phi_i$ is a local diffeomorphism according to def. ;
the final topology induced by the set of plots of $X$ makes $X_s$ a paracompact Hausdorff space.

(Khavkine-Schreiber 17, example 3.4)

For more on smooth manifolds from the perspective of formal smooth sets see at geometry of physics – manifolds and orbifolds.

$\,$

fermion fields and supergeometry

Field theories of interest crucially involve fermionic fields (def. below), such as the Dirac field (example below), which are subject to the “Pauli exclusion principle”, a key reason for the stability of matter. Mathematically this principle means that these fields have field bundles whose field fiber is not an ordinary manifold, but an odd-graded supermanifold (more on this in remark and remark below).

This “supergeometry” is an immediate generalization of the infinitesimal geometry above, where now the infinitesimal elements in the algebra of functions may be equipped with a grading: one speaks of superalgebra.

The “super”-terminology for something as down-to-earth as the mathematical principle behind the stability of matter may seem unfortunate. For better or worse, this terminology has become standard since the middle of the 20th century. But the concept that today is called supercommutative superalgebra was in fact first considered by Grassmann 1844 who got it right (“Ausdehnungslehre”) but apparently was too far ahead of his time and remained unappreciated.

Beware that considering supergeometry does not necessarily involve considering “supersymmetry”. Supergeometry is the geometry of fermion fields (def. below), and that all matter fields in the observable universe are fermionic has been experimentally established since the Stern-Gerlach experiment in 1922. Supersymmetry, on the other hand, is a hypothetical extension of spacetime-symmetry within the context of supergeometry. Here we do not discuss supersymmetry; for details see instead at geometry of physics – supersymmetry.

Definition

(supercommutative superalgebra)

A real $\mathbb{Z}/2$ -graded algebra or superalgebra is an associative algebra $A$ over the real numbers together with a direct sum decomposition of its underlying real vector space

A \simeq_{\mathbb{R}} A_{even} \oplus A_{odd} \,,

such that the product in the algebra respects the multiplication in the cyclic group of order 2 $\mathbb{Z}/2 = \{even, odd\}$ :

\left. \array{ A_{even} \cdot A_{even} \\ A_{odd} \cdot A_{odd} } \right\} \subset A_{even} \phantom{AAAA} \left. \array{ A_{odd} \cdot A_{even} \\ A_{even} \cdot A_{odd} } \right\} \subset A_{odd} \,.

This is called a supercommutative superalgebra if for all elements $a_1, a_2 \in A$ which are of homogeneous degree ${\vert a_i \vert} \in \mathbb{Z}/2 = \{even, odd\}$ in that

a_i \in A_{{\vert a_i\vert}} \subset A

we have

a_1 \cdot a_2 = (-1)^{{\vert a_1 \vert \vert a_2 \vert}} a_2 \cdot a_1 \,.

A homomorphism of superalgebras

f \;\colon\; A \longrightarrow A'

is a homomorphism of associative algebras over the real numbers such that the $\mathbb{Z}/2$ -grading is respected in that

f(A_{even}) \subset A'_{even} \phantom{AAAAA} f(A_{odd}) \subset A'_{odd} \,.

For more details on superalgebra see at geometry of physics – superalgebra.

Example

(basic examples of supercommutative superalgebras)

Basic examples of supercommutative superalgebras (def. ) include the following:

Every commutative algebra $A$ becomes a supercommutative superalgebra by declaring it to be all in even degree: $A = A_{even}$ .
For $V$ a finite dimensional real vector space, then the Grassmann algebra $A \coloneqq \wedge^\bullet_{\mathbb{R}} V^\ast$ is a supercommutative superalgebra with $A_{even} \coloneqq \wedge^{even} V^\ast$ and $A_{odd} \coloneqq \wedge^{odd} V^\ast$ .

More explicitly, if $V = \mathbb{R}^s$ is a Cartesian space with canonical dual coordinates $(\theta^i)_{i = 1}^s$ then the Grassmann algebra $\wedge^\bullet (\mathbb{R}^s)^\ast$ is the real algebra which is generated from the $\theta^i$ regarded in odd degree and hence subject to the relation

$\theta^i \cdot \theta^j = - \theta^j \cdot \theta^i \,.$

In particular this implies that all the $\theta^i$ are infinitesimal (def. ):

$\theta^i \cdot \theta^i = 0 \,.$
For $A_1$ and $A_2$ two supercommutative superalgebras, there is their tensor product supercommutative superalgebra $A_1 \otimes_{\mathbb{R}} A_2$ . For example for $X$ a smooth manifold with ordinary algebra of smooth functions $C^\infty(X)$ regarded as a supercommutative superalgebra by the first example above, the tensor product with a Grassmann algebra (second example above) is the supercommutative superalgebta

$C^\infty(X) \otimes_{\mathbb{R}} \wedge^\bullet((\mathbb{R}^s)\ast)$

whose elements may uniquely be expanded in the form

$f + f_i \theta^i + f_{i j} \theta^i \theta^j + f_{i j k} \theta^i \theta^j \theta^k + \cdots + f_{i_1 \cdots i_s} \theta^{i_1} \cdots \theta^{i_s} \,,$

where the $f_{i_1 \cdots i_k} \in C^\infty(X)$ are smooth functions on $X$ which are skew-symmetric in their indices.

The following is the direct super-algebraic analog of the definition of infinitesimally thickened Cartesian spaces (def. ):

Definition

(super Cartesian space)

A superpoint $Spec(A)$ is represented by a super-commutative superalgebra $A$ (def. ) which as a $\mathbb{Z}/2$ -graded vector space (super vector space) is a direct sum

A \simeq_{\mathbb{R}} \langle 1 \rangle \oplus V

of the 1-dimensional even vector space $\langle 1 \rangle = \mathbb{R}$ of multiples of 1, with a finite dimensional super vector space $V$ that is a nilpotent ideal in $A$ in that for each element $a \in V$ there exists a natural number $n \in \mathbb{N}$ such that

a^{n+1} = 0 \,.

More generally, a super Cartesian space $\mathbb{R}^n \times Spec(A)$ is represented by a super-commutative algebra $C^\infty(\mathbb{R}^n) \otimes A \in \mathbb{R} Alg$ which is the tensor product of algebras of the algebra of smooth functions $C^\infty(\mathbb{R}^n)$ on an actual Cartesian space of some dimension $n$ , with an algebra of functions $A \simeq_{\mathbb{R}} \langle 1\rangle \oplus V$ of a superpoint (example ).

Specifically, for $s \in \mathbb{N}$ , there is the superpoint

(26)

\mathbb{R}^{0 \vert s} \;\coloneqq\; Spec\left( \wedge^\bullet (\mathbb{R}^s)^\ast \right)

whose algebra of functions is by definition the Grassmann algebra on $s$ generators $(\theta^i)_{i = 1}^s$ in odd degree (example ).

We write

\begin{aligned} \mathbb{R}^{b\vert s} & \coloneqq \mathbb{R}^b \times \mathbb{R}^{0 \vert s} \\ & = \mathbb{R}^b \times Spec( \wedge^\bullet(\mathbb{R}^s)^\ast ) \\ & = Spec\left( C^\infty(\mathbb{R}^b) \otimes_{\mathbb{R}} \wedge^\bullet (\mathbb{R}^s)^\ast \right) \end{aligned}

for the corresponding super Cartesian spaces whose algebra of functions is as in example .

We say that a smooth function between two super Cartesian spaces

\mathbb{R}^{n_1} \times Spec(A_1) \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \times Spec(A_2)

is by definition dually a homomorphism of supercommutative superalgebras (def. ) of the form

C^\infty(\mathbb{R}^{n_1}) \otimes A_1 \overset{f^\ast}{\longleftarrow} C^\infty(\mathbb{R}^{n_2}) \otimes A_2 \,.

Example

(superpoint induced by a finite dimensional vector space)

Let $V$ be a finite dimensional real vector space. With $V^\ast$ denoting its dual vector space write $\wedge^\bullet V^\ast$ for the Grassmann algebra that it generates. This being a supercommutative algebra, it defines a superpoint (def. ).

We denote this superpoint by

V_{odd} \simeq \mathbb{R}^{0 \vert dim(V)} \,.

All the differential geometry over Cartesian space that we discussed above generalizes immediately to super Cartesian spaces (def. ) if we stricly adhere to the super sign rule which says that whenever two odd-graded elements swap places, a minus sign is picked up. In particular we have the following generalization of the de Rham complex on Cartesian spaces discussed above.

Definition

(super differential forms on super Cartesian spaces)

For $\mathbb{R}^{b\vert s}$ a super Cartesian space (def. ), hence the formal dual of the supercommutative superalgebra of the form

C^\infty(\mathbb{R}^{b\vert s}) \;=\; C^\infty(\mathbb{R}^b) \otimes_{\mathbb{R}} \wedge^\bullet \mathbb{R}^s

with canonical even-graded coordinate functions $(x^i)_{i = 1^b}$ and odd-graded coordinate functions $(\theta^j)_{j = 1}^s$ .

Then the de Rham complex of super differential forms on $\mathbb{R}^{b\vert s}$ is, in super-generalization of def. , the $\mathbb{Z} \times (\mathbb{Z}/2)$ -graded commutative algebra

\Omega^\bullet(\mathbb{R}^{b|s}) \;\coloneqq\; C^\infty(\mathbb{R}^{b|s}) \otimes_{\mathbb{R}} \wedge^\bullet \langle d x^1, \cdots, d x^b, \; d \theta^1, \cdots, d\theta^s \rangle

which is generated over $C^\infty(\mathbb{R}^{b\vert s})$ from new generators

\underset{ \text{deg} = (1,even) }{\underbrace{ d x^i }} \phantom{AAAAA} \underset{ \text{deg} = (1,odd) }{ \underbrace{ d \theta^j } }

whose differential is defined in degree-0 by

d f \;\coloneqq\; \frac{\partial f}{\partial x^i} d x^i + \frac{\partial f}{\partial \theta^j} d \theta^j

and extended from there as a bigraded derivation of bi-degree $(1,even)$ , in super-generalization of def. .

Accordingly, the operation of contraction with tangent vector fields (def. ) has bi-degree $(-1,\sigma)$ if the tangent vector has super-degree $\sigma$ :

generator	bi-degree
$\phantom{AA} x^a$	(0,even)
$\phantom{AA} \theta^\alpha$	(0,odd)
$\phantom{AA} dx^a$	(1,even)
$\phantom{AA} d\theta^\alpha$	(1,odd)

derivation	bi-degree
$\phantom{AA} d$	(1,even)
$\phantom{AA}\iota_{\partial x^a}$	(-1, even)
$\phantom{AA}\iota_{\partial \theta^\alpha}$	(-1,odd)

This means that if $\alpha \in \Omega^\bullet(\mathbb{R}^{b\vert s})$ is in bidegree $(n_\alpha, \sigma_\alpha)$ , and $\beta \in \Omega^\bullet(\mathbb{R}^{b \vert \sigma})$ is in bidegree $(n_\beta, \sigma_\beta)$ , then

\alpha \wedge \beta \; = \; (- 1)^{n_\alpha n_\beta + \sigma_\alpha \sigma_\beta} \; \beta \wedge \alpha \,.

Hence there are two contributions to the sign picked up when exchanging two super-differential forms in the wedge product:

there is a “cohomological sign” which for commuting an $n_1$ -forms past an $n_2$ -form is $(-1)^{n_1 n_2}$ ;
in addition there is a “super-grading” sign which for commuting a $\sigma_1$ -graded coordinate function past a $\sigma_2$ -graded coordinate function (possibly under the de Rham differential) is $(-1)^{\sigma_1 \sigma_2}$ .

For example:

x^{a_1} (dx^{a_2}) \;=\; + (dx^{a_2}) x^{a_1}

\theta^\alpha (dx^a) \;=\; + (dx^a) \theta^\alpha

\theta^{\alpha_1} (d\theta^{\alpha_2}) \;=\; - (d\theta^{\alpha_2}) \theta^{\alpha_1}

dx^{a_1} \wedge d x^{a_2} \;=\; - d x^{a_2} \wedge d x^{a_1}

dx^a \wedge d \theta^{\alpha} \;=\; - d\theta^{\alpha} \wedge d x^a

d\theta^{\alpha_1} \wedge d \theta^{\alpha_2} \;=\; + d\theta^{\alpha_2} \wedge d \theta^{\alpha_1}

(e.g. Castellani-D’Auria-Fré 91 (II.2.106) and (II.2.109), Deligne-Freed 99, section 6)

Beware that there is also another sign rule for super differential forms used in the literature. See at signs in supergeometry for further discussion.

$\,$

It is clear now by direct analogy with the definition of formal smooth sets (def. ) what the corresponding supergeometric generalization is. For definiteness we spell it out yet once more:

Definition

(super smooth set)

A super smooth set $X$ is

for each super Cartesian space $\mathbb{R}^n \times Spec(A)$ (def. ) a set

$X(\mathbb{R}^n \times Spec(A)) \in Set$

to be called the set of smooth functions or plots from $\mathbb{R}^n \times Spec(A)$ to $X$ ;
for each smooth function $f \;\colon\; \mathbb{R}^{n_1} \times Spec(A_1) \longrightarrow \mathbb{R}^{n_2} \times Spec(A_2)$ between super Cartesian spaces a choice of function

$f^\ast \;\colon\; X(\mathbb{R}^{n_2} \times Spec(A_2)) \longrightarrow X(\mathbb{R}^{n_1} \times Spec(A_1))$

to be thought of as the precomposition operation

$\left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{\Phi}{\to} X \right)$

such that

(functoriality)
1. If $id_{\mathbb{R}^n \times Spec(A)} \;\colon\; \mathbb{R}^n \times Spec(A) \to \mathbb{R}^n \times Spec(A)$ is the identity function on $\mathbb{R}^n \times Spec(A)$ , then $\left(id_{\mathbb{R}^n \times Spec(A)}\right)^\ast \;\colon\; X(\mathbb{R}^n \times Spec(A)) \to X(\mathbb{R}^n \times Spec(A))$ is the identity function on the set of plots $X(\mathbb{R}^n \times Spec(A))$ .
2. If $\mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{g}{\to} \mathbb{R}^{n_3} \times Spec(A_3)$ are two composable smooth functions between infinitesimally thickened Cartesian spaces, then pullback of plots along them consecutively equals the pullback along the composition:
  
  $f^\ast \circ g^\ast = (g \circ f)^\ast$
  
  i.e.
  
  $\array{ && X(\mathbb{R}^{n_2} \times Spec(A_2)) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1} \times Spec(A_1)) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3} \times Spec(A_3)) }$
(gluing)

If $\{ U_i \times Spec(A) \overset{f_i \times id_{Spec(A)}}{\to} \mathbb{R}^n \times Spec(A)\}_{i \in I}$ is such that

$\{ U_i \overset{f_i }{\to} \mathbb{R}^n \}_{i \in I}$

is a differentiably good open cover (def. ) then the function which restricts $\mathbb{R}^n \times Spec(A)$ -plots of $X$ to a set of $U_i \times Spec(A)$ -plots

$X(\mathbb{R}^n \times Spec(A)) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i \times Spec(A))$

is a bijection onto the set of those tuples $(\Phi_i \in X(U_i))_{i \in I}$ of plots, which are “matching families” in that they agree on intersections:

$\phi_i\vert_{((U_i \cap U_j) \times Spec(A)} = \phi_j \vert_{(U_i \cap U_j)\times Spec(A)}$

i.e.

$\array{ && (U_i \cap U_j) \times Spec(A) \\ & \swarrow && \searrow \\ U_i\times Spec(A) && && U_j \times Spec(A) \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }$

Finally, given $X_1$ and $X_2$ two super formal smooth sets, then a smooth function between them

f \;\colon\; X_1 \longrightarrow X_2

for each super Cartesian space $\mathbb{R}^n \times Spec(A)$ a function

$f_\ast(\mathbb{R}^n \times Spec(A)) \;\colon\; X_1(\mathbb{R}^n \times Spec(A)) \longrightarrow X_2(\mathbb{R}^n \times Spec(A))$

such that

for each smooth function $g \colon \mathbb{R}^{n_1} \times Spec(A_1) \to \mathbb{R}^{n_2} \times Spec(A_2)$ between super Cartesian spaces we have

$g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2} \times Spec(A_2)) = f_\ast(\mathbb{R}^{n_1} \times Spec(A_1)) \circ g^\ast_1$

i.e.

$\array{ X_1(\mathbb{R}^{n_2} \times Spec(A_2)) &\overset{f_\ast(\mathbb{R}^{n_2}\times Spec(A_2) )}{\longrightarrow}& X_2(\mathbb{R}^{n_2} \times Spec(A_2)) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1} \times Spec(A_1)) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1} \times Spec(A_1)) }$

(Yetter 88)

Basing supergeometry on super formal smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – supergeometry.

In direct generalization of example we have:

Example

(super Cartesian spaces are super smooth sets)

Let $X$ be a super Cartesian space (def. ) Then it becomes a super smooth set (def. ) by declaring its plots $\Phi \in X(\mathbb{R}^n \times \mathbb{D})$ to the algebra homomorphisms $C^\infty(\mathbb{R}^n \times \mathbb{D}) \leftarrow C^\infty(\mathbb{R}^{b\vert s})$ .

Under this identification, morphisms between super Cartesian spaces are in natural bijection with their morphisms regarded as super smooth sets.

Stated more abstractly, this statement is an example of the Yoneda embedding over a subcanonical site.

Similarly, in direct generalization of prop. we have:

Proposition

(plots of a super smooth set really are the smooth functions into the smooth smooth set)

Let $X$ be a super smooth set (def. ). For $\mathbb{R}^n \times \mathbb{D}$ any super Cartesian space (def. ) there is a natural function

Hom_{SmoothSet}(\mathbb{R}^n , X) \overset{\simeq}{\longrightarrow} X(\mathbb{R}^n)

from the set of homomorphisms of super smooth sets from $\mathbb{R}^n \times \mathbb{D}$ (regarded as a super smooth set via example ) to $X$ , to the set of plots of $X$ over $\mathbb{R}^n \times \mathbb{D}$ , given by evaluating on the identity plot $id_{\mathbb{R}^n \times \mathbb{D}}$ .

This function is a bijection.

This says that the plots of $X$ , which initially bootstrap $X$ into being as declaring the would-be smooth functions into $X$ , end up being the actual smooth functions into $X$ .

Proof

This is the statement of the Yoneda lemma over the site of super Cartesian spaces.

We do not need to consider here supermanifolds more general than the super Cartesian spaces (def. ). But for those readers familiar with the concept we include the following direct analog of the characterization of smooth manifolds according to def./prop. :

Definition/Proposition

(supermanifolds)

A supermanifold $X$ of dimension super-dimension $(b,s) \in \mathbb{N} \times \mathbb{N}$ is

a super smooth set (def. )

such that

there exists an indexed set $\{ \mathbb{R}^{b\vert s} \overset{\phi_i}{\to} X\}_{i \in I}$ of morphisms of super smooth sets (def. ) from super Cartesian spaces $\mathbb{R}^{b\vert s}$ (def. ) (regarded as super smooth sets via example into $X$ , such that
1. for every plot $\mathbb{R}^n \times \mathbb{D} \to X$ there is a differentiably good open cover (def. ) restricted to which the plot factors through the $\mathbb{R}^{b\vert s}_i$ ;
2. every $\phi_i$ is a local diffeomorphism according to def. , now with respect not just to infinitesimally thickened points, but with respect to superpoints;
the bosonic part of $X$ is a smooth manifold according to def./prop. .

Finally we have the evident generalization of the smooth moduli space $\mathbf{\Omega}^\bullet$ of differential forms from example to supergeometry

Example

(universal smooth moduli spaces of super differential forms)

For $n \in \mathbf{M}$ write

\mathbf{\Omega}^n \;\in\; SuperSmoothSet

for the super smooth set (def. ) whose set of plots on a super Cartesian space $U \in SuperCartSp$ (def. ) is the set of super differential forms (def. ) of cohomolgical degree $n$

\mathbf{\Omega}^n(U) \;\coloneqq\; \Omega^n(U)

and whose maps of plots is given by pullback of super differential forms.

The de Rham differential on super differential forms applied plot-wise yields a morpism of super smooth sets

(27)

d \;\colon\; \mathbf{\Omega}^n \longrightarrow \mathbf{\Omega}^{n+1} \,.

As before in def. we then define for any super smooth set $X \in SuperSmoothSet$ its set of differential $n$ -forms to be

\Omega^n(X) \;\coloneqq\; Hom_{SuperSmoothSet}(X,\mathbf{\Omega}^n)

and we define the de Rham differential on these to be given by postcomposition with (27).

$\,$

Definition

(bosonic fields and fermionic fields)

For $\Sigma$ a spacetime, such as Minkowski spacetime (def. ) if a fiber bundle $E \overset{fb}{\longrightarrow} \Sigma$ with total space a super Cartesian space (def. ) (or more generally a supermanifold, def./prop. ) is regarded as a super-field bundle (def. ), then

the even-graded sections are called the bosonic field histories;
the odd-graded sections are called the fermionic field histories.

In components, if $E = \Sigma \times F$ is a trivial bundle with fiber a super Cartesian space (def. ) with even-graded coordinates $(\phi^a)$ and odd-graded coordinates $(\psi^A)$ , then the $\phi^a$ are called the bosonic field coordinates, and the $\psi^A$ are called the fermionic field coordinates.

What is crucial for the discussion of field theory is the following immediate supergeometric analog of the smooth structure on the space of field histories from example :

Example

(supergeometric space of field histories)

Let $E \overset{fb}{\to} \Sigma$ be a super-field bundle (def. , def. ).

Then the space of sections, hence the space of field histories, is the super formal smooth set (def. )

\Gamma_\Sigma(E) \in SuperSmoothSet

whose plots $\Phi_{(-)}$ for a given Cartesian space $\mathbb{R}^n$ and superpoint $\mathbb{D}$ (def. ) with the Cartesian products $U \coloneqq \mathbb{R}^n \times \mathbb{D}$ and $U \times \Sigma$ regarded as super smooth sets according to example are defined to be the morphisms of super smooth set (def. )

\array{ U \times \Sigma &\overset{\Phi_{(-)}(-)}{\longrightarrow}& E }

which make the following diagram commute:

\array{ && E \\ & {}^{\mathllap{\Phi_{(-)}(-)}}\nearrow & \downarrow^{\mathrlap{fb}} \\ U \times \Sigma &\underset{pr_2}{\longrightarrow}& \Sigma } \,.

Explicitly, if $\Sigma$ is a Minkowski spacetime (def. ) and $E = \Sigma \times F$ a trivial field bundle with field fiber a super vector space (example , example ) this means dually that a plot $\Phi_{(-)}$ of the super smooth set of field histories is a homomorphism of supercommutative superalgebras (def. )

\array{ C^\infty(U \times \Sigma) &\overset{\left(\Phi_{(-)}(-)\right)^\ast}{\longleftarrow}& C^\infty(E) }

which make the following diagram commute:

\array{ && C^\infty(E) \\ & {}^{\mathllap{\left( \Phi_{(-)}(-) \right)^\ast }}\nearrow & \uparrow^{\mathrlap{fb^\ast}} \\ C^\infty(U \times \Sigma) &\underset{pr_2^\ast}{\longleftarrow}& C^\infty(\Sigma) } \,.

We will focus on discussing the supergeometric space of field histories (example ) of the Dirac field (def. below). This we consider below in example ; but first we discuss now some relevant basics of general supergeometry.

Example is really a special case of a general relative mapping space-construction as in example . This immediately generalizes also to the supergeometric context.

Definition

(super-mapping space out of a super Cartesian space)

Let $X$ be a super Cartesian space (def. ) and let $Y$ be a super smooth set (def. ). Then the mapping space

[X,Y] \;\in\; SuperSmoothSet

of super smooth functions from $X$ to $Y$ is the super formal smooth set whose $U$ -plots are the morphisms of super smooth set from the Cartesian product of super Cartesian space $U \times X$ to $Y$ , hence the $U \times X$ -plots of $Y$ :

[X,Y](U) \;\coloneqq\; Y(U \times X) \,.

In direct generalization of the synthetic tangent bundle construction (example ) to supergeometry we have

Definition

(odd tangent bundle)

Let $X$ be a super smooth set (def. ) and $\mathbb{R}^{0 \vert 1}$ the superpoint (26) then the supergeometry-mapping space

\array{ T_{odd} X & \coloneqq& [\mathbb{R}^{0\vert 1}, X] \\ {}^{\mathllap{tb_{odd}}}\downarrow && \downarrow^{\mathrlap{ [ \ast \to \mathbb{R}^{0 \vert 1}, X ] }} \\ X & = & X }

is called the odd tangent bundle of $X$ .

Example

(mapping space of superpoints)

Let $V$ be a finite dimensional real vector space and consider its corresponding superpoint $V_{odd}$ from exampe . Then the mapping space (def. ) out of the superpoint $\mathbb{R}^{0\vert 1}$ (def. ) into $V_{odd}$ is the Cartesian product $V_{odd} \times V$

[\mathbb{R}^{0\vert 1}, V_{odd}] \;\simeq\; V_{odd} \times V \,.

By def. this says that $V_{odd} \times V$ is the “odd tangent bundle” of $V_{odd}$ .

Proof

Let $U$ be any super Cartesian space. Then by definition we have the following sequence of natural bijections of sets of plots

\begin{aligned} \left[ \mathbb{R}^{0\vert 1}, V_{odd} \right](U) & = Hom_{SuperSmoothSet}( \mathbb{R}^{0\vert 1} \times U, V_{odd} ) \\ & \simeq Hom_{\mathbb{R}sAlg}( \wedge^\bullet(V^\ast)\,,\, C^\infty(U)[\theta]/(\theta^2) ) \\ & \simeq Hom_{\mathbb{R}Vect}( V^\ast \,,\, (C^\infty(U)_{odd} \oplus C^\infty(U)_{even}\langle \theta\rangle ) \\ & \simeq Hom_{\mathbb{R}Vect}( V^\ast\,,\, C^\infty(U)_{odd} ) \,\times\, Hom_{\mathbb{R}Vect}( V^\ast, C^\infty(U)_{even} ) \\ & \simeq V_{odd}(U) \times V(U) \\ & \simeq (V_{odd} \times V)(U) \end{aligned}

Here in the third line we used that the Grassmann algebra $\wedge^\bullet V^\ast$ is free on its generators in $V^\ast$ , meaning that a homomorphism of supercommutative superalgebras out of the Grassmann algebra is uniquely fixed by the underlying degree-preserving linear function on these generators. Since in a Grassmann algebra all the generators are in odd degree, this is equivalently a linear map from $V^\ast$ to the odd-graded real vector space underlying $C^\infty(U)[\theta](\theta^2)$ , which is the direct sum $C^\infty(U)_{odd} \oplus C^\infty(U)_{even}\langle \theta \rangle$ .

Then in the fourth line we used that finite direct sums are Cartesian products, so that linear maps into a direct sum are pairs of linear maps into the direct summands.

That all these bijections are natural means that they are compatible with morphisms $U \to U'$ and therefore this says that $[\mathbb{R}^{0\vert 1}, V_{odd}]$ and $V_{odd} \times V$ are the same as seen by super-smooth plots, hence that they are isomorphic as super smooth sets.

With this supergeometry in hand we finally turn to defining the Dirac field species:

Example

(field bundle for Dirac field)

For $\Sigma$ being Minkowski spacetime (def. ), of dimension $2+1$ , $3+1$ , $5+1$ or $9+1$ , let $S$ be the spin representation from prop. , whose underlying real vector space is

S \;=\; \left\{ \array{ \mathbb{R}^2 \oplus \mathbb{R}^2 & \vert & p + 1 = 2+1 \\ \mathbb{C}^2 \oplus \mathbb{C}^2 &\vert& p + 1 = 3 + 1 \\ \mathbb{H}^2 \oplus \mathbb{H}^2 &\vert& p + 1 = 5 + 1 \\ \mathbb{O}^2 \oplus \mathbb{O}^2 &\vert& p + 1 = 9 + 1 } \right.

With

S_{odd} \simeq \mathbb{R}^{0 \vert dim(S)}

the corresponding superpoint (example ), then the field bundle for the Dirac field on $\Sigma$ is

E \;\coloneqq\; \Sigma \times S_{odd} \overset{pr_1}{\to} \Sigma \,,

hence the field fiber is the superpoint $S_{odd}$ . This is the corresponding spinor bundle on Minkowski spacetime, with fiber in odd super-degree.

The traditional two-component spinor basis from remark provides fermionic field coordinates (def. ) on the field fiber $S_{odd}$ :

\left( \psi^A \right)_{A = 1}^4 \;=\; \left( (\chi_a), (\xi^{\dagger \dot a}) \right)_{a,\dot a = 1,2} \,.

Notice that these are $\mathbb{K}$ -valued odd functions: For instance if $\mathbb{K} = \mathbb{C}$ then each $\chi_a$ in turn has two components, a real part and an imaginary part.

A key point with the field bundle of the Dirac field (example ) is that the field fiber coordinates $(\psi^A)$ or $\left((\chi_a), (\xi^{\dagger \dot a})\right)$ are now odd-graded elements in the function algebra on the field fiber, which is the Grassmann algebra $C^\infty(S_{odd}) = \wedge^\bullet(S^\ast)$ . Therefore they anti-commute with each other:

(28)

\psi^\alpha \psi^{\beta} = - \psi^{\beta} \psi^\alpha \,.

snippet grabbed from (Dermisek 09)

We analyze the special nature of the supergeometry space of field histories of the Dirac field a little (prop. ) below and conclude by highlighting the crucial role of supergeometry (remark below)

Proposition

(space of field histories of the Dirac field)

Let $E = \Sigma \times S_{odd} \overset{pr_1}{\to} \Sigma$ be the super-field bundle (def. ) for the Dirac field over Minkowski spacetime $\Sigma = \mathbb{R}^{p,1}$ from example .

Then the corresponding supergeometric space of field histories

\Gamma_\Sigma(\Sigma \times S_{odd}) \;\in\; SuperSmoothSet

from example has the following properties:

For $U = \mathbb{R}^n$ an ordinary Cartesian space (with no super-geometric thickening, def. ) there is only a single $U$ -parameterized collection of field histories, hence a single plot

$\Psi_{(-)}\;\colon\;\mathbb{R}^n \overset{ 0 }{\longrightarrow} \Gamma_\Sigma(\Sigma \times S_{odd})$

and this corresponds to the zero section, hence to the trivial Dirac field

$\Psi^A_{(-)} = 0 \,.$
For $U = \mathbb{R}^{n \vert 1}$ a super Cartesian space () with a single super-odd dimension, then $U$ -parameterized collections of field histories

$\Phi_{(-)} \;\colon\; \mathbb{R}^{n\vert 1} \longrightarrow \Gamma_\Sigma(\Sigma \times S_{odd})$

are in natural bijection with plots of sections of the bosonic-field bundle with field fiber $S_{even} = S$ the spin representation regarded as an ordinary vector space:

$\Psi_{(-)} \;\colon\; \mathbb{R}^n \longrightarrow \Gamma_\Sigma(\Sigma \times S_{even}) \,.$

Moreover, these two kinds of plots determine the fermionic field space completely: It is in fact isomorphic, as a super vector space, to the bosonic field space shifted to odd degree (as in example ):

\Gamma_\Sigma(\Sigma \times S_{odd}) \;\simeq\; \left( \Gamma_\Sigma(E\times S_{even}) \right)_{odd} \,.

Proof

In the first case, the plot is a morphism of super Cartesian spaces (def. ) of the form

\mathbb{R}^n \times \mathbb{R}^{p,1} \longrightarrow S_{odd} \,.

By definitions this is dually homomorphism of real supercommutative superalgebras

C^\infty(\mathbb{R}^n \times \mathbb{R}^{p,1}) \longleftarrow \wedge^\bullet S^\ast

from the Grassmann algebra on the dual vector space of the spin representation $S$ to the ordinary algebras of smooth functions on $\mathbb{R}^n \times \mathbb{R}^{p,1}$ . But the latter has no elements in odd degree, and hence all the Grassmann generators need to be send to zero.

For the second case, notice that a morphism of the form

\mathbb{R}^{n\vert 1} \overset{\Phi_{(-)}}{\longrightarrow} S_{odd}

is by def. naturally identified with a morphism of the form

\mathbb{R}^n \overset{\Psi_{(-)}}{\longrightarrow} [\mathbb{R}^{0 \vert 1}, S_{odd}] \simeq S_{odd} \times S_{even} \,,

where the identification on the right is from example .

By the nature of Cartesian products these morphisms in turn are naturally identified with pairs of morphisms of the form

\left( \array{ \mathbb{R}^n &\overset{}{\longrightarrow}& S_{odd}\,, \\ \mathbb{R}^n &\overset{}{\longrightarrow}& S_{even} } \right) \,.

Now, as in the first point above, here the first component is uniquely fixed to be the zero morphism $\mathbb{R}^n \overset{0}{\to} S_{odd}$ ; and hence only the second component is free to choose. This is precisely the claim to be shown.

Remark

(supergeometric nature of the Dirac field)

Proposition how two basic facts about the Dirac field, which may superficially seem to be in tension with each other, are properly unified by supergeometry:

On the one hand a field history $\Psi$ of the Dirac field is not an ordinary section of an ordinary vector bundle. In particular its component functions $\psi^A$ anti-commute with each other, which is not the case for ordinary functions, and this is crucial for the Lagrangian density of the Dirac field to be well defined, we come to this below in example .
On the other hand a field history of the Dirac field is supposed to be a spinor, hence a section of a spinor bundle, which is an ordinary vector bundle.

Therefore prop. serves to shows how, even though a Dirac field is not defined to be an ordinary section of an ordinary vector bundle, it is nevertheless encoded by such an ordinary section: One says that this ordinary section is a “superfield-component” of the Dirac field, the one linear in a Grassmann variable $\theta$ .

$\,$

This concludes our discussion of the concept of fields itself. In the following chapter we consider the variational calculus of fields.

Field variations

In this chapter we discuss these topics:

Jet bundles
Differential operators
Variational calculus and the Variational bicomplex

$\,$

Given a field bundle as in def. above, then we know what type of quantities the corresponding field histories assign to a given spacetime point (a given event). Among all consistent such field histories, some are to qualify as those that “may occur in reality” if we think of the field theory as a means to describe parts of the observable universe. Moreover, if the reality to be described does not exhibit “action at a distance” then admissibility of its field histories should be determined over arbitrary small spacetime regions, in fact over the infinitesimal neighbourhood of any spacetime point (remark below). This means equivalently that the realized field histories should be those that satisfy a given differential equation, namely an equation between the partial derivatives of the field history at any spacetime point. This is called the equation of motion of the field theory (def. below).

In order to formalize this, it is useful to first collect all the possible partial derivatives that a field history may have at any given point into one big space of “field derivatives at spacetime points”. This collection is called the jet bundle of the field bundle, given as def. below.

Moving around in this space means to change the possible value of fields and their derivatives, hence to vary the fields. Accordingly variational calculus of fields is just differential calculus on the jet bundle of the field bundle, this we consider in def. below.

$\,$

jet bundles

Definition

(jet bundle of a trivial vector bundle over Minkowski spacetime)

Given a field fiber super vector space $F = \mathbb{R}^{b\vert s}$ with linear basis $(\phi^a)$ , then for $k \in \mathbb{N}$ a natural number, the order- $k$ jet bundle

\array{ J^k_{\Sigma}( E ) \\ \downarrow^{\mathrlap{jb_k}} \\ \Sigma }

over Minkowski spacetime $\Sigma$ of the trivial vector bundle

E \coloneqq \Sigma \times F

is the super Cartesian space (def. ) which is spanned by coordinate functions to be denoted as follows:

\left( (x^\mu) \,,\, (\phi^a ) \,,\, ( \phi^a_{,\mu} ) \,,\, ( \phi^a_{,\mu_1\mu_2} ) \,,\, \cdots \,,\, ( \phi^a_{,\mu_1 \cdots \mu_k} ) \,,\, \cdots \right)

where the indices $\mu, \mu_1, \mu_2, \cdots$ range from 0 to $p$ , while the index $a$ ranges from $1$ to $b$ for the even field coordinates, and then from $b+1$ to $b+s$ for the odd-graded field coordinates and the lower indices are symmetric:

(29)

\phi^a_{\mu_1 \cdots \mu_{i} \cdots \mu_j \cdots \mu_k} \;=\; \phi^a_{\mu_1 \cdots \mu_{j} \cdots \mu_i \cdots \mu_k} \,.

In terms of these coordinates the bundle projection map $jb_k$ is just the one that remembers the spacetime coordinates $x^\mu$ and forgets the values of the field $\phi^a$ and its derivatives $\phi_{\mu}$ . Similarly there are intermediate projection maps

\array{ \cdots &\overset{jb_{3,2}}{\longrightarrow}& J^{2}_\Sigma(E) &\overset{jb_{2,1}}{\longrightarrow}& J^1_\Sigma(E) &\overset{jb_{1,0}}{\longrightarrow}& E \\ && &{}_{\mathllap{jb_2}}\searrow& {}^{\mathllap{jb_1}}\downarrow &\swarrow_{\mathrlap{fb}}& \\ && && \Sigma && }

given by forgetting coordinates with more indices.

The infinite-order jet bundle

J^\infty_\Sigma(E) \in SuperSmoothSet

is the direct limit of super smooth sets (def. ) over these finite order jet bundles. Explicitly this means that it is the smooth set which is defined by the fact that a smooth function (a plot, by prop. )

U \overset{f}{\longrightarrow} J^\infty_\Sigma(E)

from some super Cartesian space $U$ is equivalently a system of ordinary smooth functions into all the finite-order jet spaces

\left( U \overset{f_k}{\longrightarrow} J^k_\Sigma(E) \right)_{k \in \mathbb{N}} \,,

such that this system is compatible with the above projection maps, i.e. such that

\underset{k \in \mathbb{N}}{\forall} \left( jb_{k+1,k} \circ f_{k+1} = f_k \right) \phantom{AAAAAAA} \array{ && && U && \\ && & {}^{\mathllap{f_2}}\swarrow& {}_{\mathllap{f_1}}\downarrow &\searrow^{f_0}& \\ \cdots &\overset{jb_{3,2}}{\longrightarrow}& J^{2}_\Sigma(E) &\overset{jb_{2,1}}{\longrightarrow}& J^1_\Sigma(E) &\overset{jb_1}{\longrightarrow}& E \\ && &{}_{\mathllap{jb_2}}\searrow& {}^{\mathllap{jb_1}}\downarrow &\swarrow_{\mathrlap{fb}}& \\ && && \Sigma && }

The coordinate functions $\phi^a_{\mu_1 \cdots \mu_k}$ on a jet bundle (def. ) are to be thought of as partial derivatives $\frac{\partial}{\partial x^{\mu_1}} \cdots \frac{\partial}{\partial x^{\mu_k}} \Phi^a$ of components $\Phi^a$ of would-be field histories $\Phi$ . The power of the jet bundle is that it allows to disentangle relations between would-be partial derivatives of field history components in themselves from consideration of actual field histories. In traditional physics texts this is often done implicitly. We may make it fully explit by the operation of jet prolongation which reads in a field history and records all its partial derivatives in the form of a section of the jet bundle:

Definition

(jet prolongation)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) which happens to be a trivial vector bundle over Minkowski spacetime as in example .

There is a smooth function from the space of sections of $E$ , the space of field histories (example ) to the space of sections of the jet bundle $J^\infty_\Sigma(E) \overset{jb^\infty}{\to} \Sigma$ (def. ) which records the field $\Phi$ and all its spacetimes derivatives:

\array{ \Gamma_\Sigma(E) &\overset{j^\infty_\Sigma}{\longrightarrow}& \Gamma_\Sigma(J^\infty_\Sigma(E)) \\ (\Phi^a) &\mapsto& \left( \left( \Phi^a \right) \,,\, \left( \frac{\partial \Phi^a}{\partial x^\mu} \right) \,,\, \left( \frac{\partial^2 \Phi^a}{\partial x^{\mu_1} \partial x^{\mu_2}} \right) \,,\, \cdots \right) } \,.

This is called the operation of jet prolongation: $j^\infty_\Sigma(\Phi)$ is the jet prolongation of $\Phi$ .

Remark

(jet bundle in terms of synthetic differential geometry)

In terms of the infinitesimal geometry of formal smooth sets (def. ) the jet bundle $J^\infty_\Sigma(E) \overset{jb_\infty}{\to} \Sigma$ (def. ) of a field bundle $E \overset{fb}{\to}\Sigma$ has the following incarnation:

A section of the jet bundle over a point $x \in \Sigma$ of spacetime (an event), is equivalently a section of the original field bundle over the infinitesimal neighbourhood $\mathbb{D}_x$ of that point (example ):

\left\{ \array{ && J^\infty_\Sigma(E) \\ & \nearrow & \downarrow^{\mathrlap{jb_\infty}} \\ \{x\} &\hookrightarrow& \Sigma } \phantom{AA} \right\} \phantom{AA} \simeq \phantom{AA} \left\{ \array{ && E \\ & {}^{\mathllap{}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \mathbb{D}_x &\hookrightarrow& \Sigma } \phantom{AA} \right\} \,.

Moreover, given a field history $\Phi$ , hence a section of the field bundle, then its jet prolongation $j^\infty(\Phi)$ (def. ) is that section of the jet bundle which under the above identification is simply the restriction of $\Phi$ to the infinitesimal neighbourhood of $x$ :

\array{ && E \\ & {}^{\mathllap{\Phi}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \Sigma & = & \Sigma } \phantom{AAAA}\overset{j^\infty_\Sigma}{\mapsto} \phantom{AAAA} \array{ && J^\infty_\Sigma(E) \\ & {}^{\mathllap{j^\infty_\Sigma(\Phi)}}\nearrow & \downarrow^{\mathrlap{jb_\infty}} \\ \Sigma &=& \Sigma } \phantom{AAAA} \overset{(-)\vert_{\{x\}} }{\mapsto} \phantom{AAAA} \array{ && E \\ & {}^{\mathllap{\Phi\vert_{\mathbb{D}_x}}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \mathbb{D}_x &\hookrightarrow& \Sigma } \,.

This follows with an argument as in example .

Hence in synthetic differential geometry we have:

The jet of a section $\Phi$ at $x$ is simply the restriction of that section to the infinitesimal neighbourhood of $x$ .

(Khavkine-Schreiber 17, section 3.3)

So the canonical coordinates on the jet bundle are the spacetime-point-wise possible values of fields and field derivates, while the jet prolongation picks the actual collections of field derivatives that may occur for an actual field history.

Example

(universal Faraday tensor/field strength on jet bundle)

Consider the field bundle (def. ) of the electromagnetic field (example ) over Minkowski spacetime $\Sigma$ (def. ), i.e. the cotangent bundle $E = T^\ast \Sigma$ (def. ) with jet coordinates $((x^\mu), (a_\mu), (a_{\mu,\nu}), \cdots )$ (def. ). Consider the functions on the jet bundle given by the linear combinations

(30)

\begin{aligned} f_{\mu \nu} & \coloneqq a_{[\nu,\mu]} \\ & \coloneqq \tfrac{1}{2}\left( a_{\nu,\mu} - a_{\mu,\nu} \right) \end{aligned}

of the first order jets.

Then for an electromagnetic field history (“vector potential”), hence a section

A \in \Gamma_\Sigma(T^\ast \Sigma) = \Omega^1(\Sigma)

with components $A^\ast (a_\mu) = A_\mu$ , its jet prolongation (def. )

j^\infty_\Sigma(A) \in \Gamma_\Sigma(J^\infty_\Sigma(T^\ast \Sigma))

has components

\left( (A_\mu), \left( \frac{d A_\mu}{d x^\nu} \right) , \cdots \right) \,.

The pullback of the functions $f_{\mu \nu}$ (30) along this jet prolongation are the components of the Faraday tensor of the field (20):

\begin{aligned} \left(j^\infty_\Sigma(A)\right)^\ast(f_{\mu \nu}) & = F_{\mu \nu} \\ & = (d A)_{\mu \nu} \,. \end{aligned}

More generally, for $\mathfrak{g}$ a Lie algebra and

E \coloneqq T^\ast \Sigma \otimes \mathfrak{g}

the field bundle for Yang-Mills theory from example , consider the functions

f^\alpha_{\mu \nu} \;\in \; \Omega^{0,0}_\Sigma(E) = C^\infty(J^\infty_\Sigma(E))

on the jet bundle given by

(31)

\begin{aligned} f^\alpha_{\mu \nu} & \coloneqq \tfrac{1}{2} \left( a^\alpha_{\nu,\mu} - a^\alpha_{\mu,\nu} + \gamma^{\alpha}{}_{\beta \gamma} a^\beta_{\mu} a^\gamma_{\nu} \right) \end{aligned}

where $(\gamma^\alpha{}_{\beta \gamma})$ are the structure constants of the Lie algebra as in (21), and where the square brackets around the indices denote anti-symmetrization.

We may call this the universal Yang-Mills field strength, being the covariant exterior derivative of the universal Yang-Mills field history.

For $\mathfrak{g} = \mathbb{R}$ the line Lie algebra and $k$ the canonical inner product on $\mathbb{R}$ the expression (31) reduces to the universal Faraday tensor (30) for the electromagnetic field (example ).

For $A \in \Gamma_\Sigma(T^\ast \Sigma \otimes \mathfrak{g}) = \Omega^1(\Sigma,\mathfrak{g})$ a field history of Yang-Mills theory, hence a Lie algebra-valued differential 1-form, then the value of this function on that field are called the components of the covariant exterior derivative or field strength

\begin{aligned} F_{\mu \nu} & \coloneqq A^\ast(D_{[\mu} a_{\nu]}) \\ & = (d_A A)_{\mu \nu} \end{aligned}

Example

(universal B-field strength on jet bundle)

Consider the field bundle (def. ) of the B-field (example ) over Minkowski spacetime $\Sigma$ (def. ) with jet coordinates $((x^\mu), (b_{\mu \nu}), (b_{\mu \nu,\rho}), \cdots )$ (def. ). Consider the functions on the jet bundle given by the linear combinations

(32)

\begin{aligned} h_{\mu_1 \mu_2 \mu_3} & \coloneqq \tfrac{1}{2} b_{[\mu_1 \mu_2, \mu_3]} \\ & \coloneqq \tfrac{1}{6} \left( \underset{ \sigma \atop \text{permutation} }{\sum} (-1)^{ {\vert \sigma \vert} } b_{\mu_{\sigma_1} \mu_{\sigma_2}, \mu_{\sigma_3}} \right) \\ & = b_{\mu_1 \mu_2, \mu_3} + b_{\mu_2 \mu_3, \mu_1} + b_{\mu_3 \mu_1, \mu_2} \,, \end{aligned}

where in the last step we used that $b_{\mu \nu} = - b_{\nu \mu}$ .

$\,$

While the jet bundle (def. ) is not finite dimensional, reflecting the fact that there are arbitrarily high orders of spacetime derivatives of a field histories, it turns out that it is only very “mildly infinite dimensional” in that smooth functions on jet bundles turn out to locally depend on only finitely many of the jet coordinates (i.e. only on a finite order of spacetime derivatives). This is the content of the following prop. .

This reflects the locality of Lagrangian field theory defined over jet bundles: If functions on the jet bundle could depend on infinitely many jet coordinates, then by Taylor series expansion of fields the function at one point over spacetime could in fact depend on field history values at a different point of spacetime. Such non-local dependence is ruled out by prop. below.

In practice this means that the situation is very convenient:

Any given local Lagrangian density (which will define a field theory, we come to this in def. below) will locally depend on some finite number $k$ of derivatives and may hence locally be treated as living on the ordinary manifold $J^k_\Sigma(E)$ .
while at the same time all formulas (such as for the Euler-Lagrange equations, def. ) work uniformly without worries about fixing a maximal order of derivatives.

Proposition

(jet bundle is a locally pro-manifold)

Given a jet bundle $J^\infty_\Sigma(E)$ as in def. , then a smooth function out of it

J^\infty_\Sigma(E) \longrightarrow X

is such that around each point of $J^\infty_\Sigma(E)$ there is a neighbourhood $U \subset J^\infty_\Sigma(E)$ on which it is given by a function on a smooth function on $J^k_\Sigma(E)$ for some finite $k$ .

(see Khavkine-Schreiber 17, section 2.2 and 3.3)

$\,$

differential operators

Example shows that the de Rham differential (def. ) may be encoded in terms of composing jet prolongation with a suitable function on the jet bundle. More generally, jet prolongation neatly encodes (possibly non-linear) differential operators:

Definition

(differential operator)

Let $E_1 \overset{fb_1}{\to} \Sigma$ and $E_2 \overset{fb_2}{\to} \Sigma$ be two smooth fiber bundles over a common base space $\Sigma$ . Then a (possibly non-linear) differential operator from sections of $E_1$ to sections of $E_2$ is a bundle morphism from the jet bundle of $E_1$ (def. ) to $E_2$ :

\array{ J^\infty_\Sigma(E_1) && \overset{\tilde D}{\longrightarrow} && E_2 \\ & \searrow && \swarrow \\ && \Sigma }

or rather the function $D$ between the spaces of sections of these bundles which this induces after composition with jet prolongation (def. ):

D \;\colon\; \Gamma_\Sigma(E_1) \overset{j^\infty_\Sigma}{\longrightarrow} \Gamma_\Sigma(J^\infty_\Sigma(E_1)) \overset{\tilde D \circ (-)}{\longrightarrow} \Gamma_\Sigma(E_2) \,.

If both $E_1$ and $E_2$ are vector bundles (def. ) so that their spaces of sections canonically are vector spaces, then $D$ is called a linear differential operator if it is a linear function between these vector spaces. This means equivalently that $\tilde D$ is a linear function in jet coordinates.

Definition

(normally hyperbolic differential operator on Minkowski spacetime)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) which is a vector bundle (def. ) over Minkowski spacetime (def. ). Write $E^\ast \overset{}{\to} \Sigma$ for its dual vector bundle (def. )

A linear differential operator (def. )

P \;\colon\; \Gamma_\Sigma(E) \longrightarrow \Gamma_{\Sigma}(E^\ast)

is of second order if it has a coordinate expansion of the form

(P \Phi)_a \;=\; P^{\mu \nu}_{a b} \frac{\partial^2 \Phi^b}{\partial x^\mu \partial x^\nu} + P^\mu_{a b} \frac{\partial \Phi^b}{\partial x^\mu} + P_{a b} \Phi^b

for $\{(P^{\mu \nu}_{a b}), (P^\mu_{a b}), P_{a b}\}$ smooth functions on $\Sigma$ .

This is called a normally hyperbolic differential operator if its principal symbol $(P^{\mu \nu}_{a b})$ is proportional to the inverse Minkowski metric (prop./def. ) $(\eta^{\mu \nu})$ , i.e.

P^{\mu \nu}_{a b} = \eta^{\mu \nu} Q_{a b} \,.

Definition

(formally adjoint differential operators)

Let $E \overset{fb}{\to} \Sigma$ be a smooth vector bundle (def. ) over Minkowski spacetime $\Sigma = \mathbb{R}^{p,1}$ (def. ) and write $E^\ast \to \Sigma$ for the dual vector bundle (def. ).

Then a pair of linear differential operators (def. ) of the form

P, P^\ast \;\colon\; \Gamma_\Sigma(E_1) \longrightarrow \Gamma_\Sigma(E^\ast)

are called formally adjoint differential operators via a bilinear differential operator

(33)

K \;\colon\; \Gamma_\Sigma(E) \otimes \Gamma_\Sigma(E) \longrightarrow \Gamma_\Sigma(\wedge^{p} T^\ast \Sigma)

with values in differential p-forms (def. ) such that for all sections $\Phi_1, \Phi_2 \in \Gamma_\Sigma(E)$ we have

\left( P(\Phi_1) \cdot \Phi_2 - \Phi_1 \cdot P^\ast(\Phi_2) \right) dvol_\Sigma \;=\; d K(\Phi_1, \Phi_2) \,,

where $dvol_\Sigma$ is the volume form on Minkowski spacetime (10) and where $d$ denoted the de Rham differential (def. ).

This implies by Stokes' theorem (prop. ) in the case of compact support that under an integral $P$ and $P^\ast$ are related via integration by parts.

(Khavkine 14, def. 2.4)

$\,$

variational calculus and the variational bicomplex

Remark

(variational calculus – replacing plain bundle morphisms by differential operators)

Various concepts in variational calculus, especially the concept of evolutionary vector fields (def. below) and gauge parameterized implicit infinitesimal gauge symmetries (def. below) follow from concepts in plain differential geometry by systematically replacing plain bundle morphisms by bundle morphisms out of the jet bundle, hence by differential operators $\tilde D$ as in def. .

Definition

(variational derivative and total spacetime derivative – the variational bicomplex)

On the jet bundle $J^\infty_\Sigma(E)$ of a trivial super vector space-vector bundle over Minkowski spacetime as in def. we may consider its de Rham complex of super differential forms (def. ); we write its de Rham differential (def. ) in boldface:

d \;\colon\; \Omega^\bullet(J^\infty_\Sigma(E)) \longrightarrow \Omega^{\bullet+1}(J^\infty_\Sigma(E)) \,.

Since the jet bundle unifies spacetime with field values, we want to decompose this differential into a contribution coming from forming the total derivatives of fields along spacetime (“horizontal derivatives”), and actual variation of fields at a fixed spacetime point (“vertical derivatives”):

The total spacetime derivative or horizontal derivative on $J^\infty_\Sigma(E)$ is the map on differential forms on the jet bundle of the form

d \;\colon\; \Omega^\bullet( J^\infty_\Sigma(E) ) \longrightarrow \Omega^{\bullet+1}( J^\infty_\Sigma(E) )

which on functions $f \colon J^\infty_\Sigma(E) \to \mathbb{R}$ (i.e. on 0-forms) is defined by

(34)

\begin{aligned} d f & \coloneqq \frac{d f}{d x^\mu} \mathbf{d} x^\mu \\ & \coloneqq \left( \frac{\partial f}{\partial x^\mu} + \frac{\partial f}{\partial \phi^a} \phi^a_{,\mu} + \frac{ \partial f }{ \partial \phi^a_{,\nu}} \phi^a_{,\nu \mu } + \cdots \right) \mathbf{d} x^\mu \end{aligned}

and extended to all forms by the graded Leibniz rule, hence as a nilpotent derivation of degree +1.

The variational derivative or vertical derivative

(35)

\delta \;\colon\; \Omega^\bullet( J^\infty_\Sigma(E) ) \longrightarrow \Omega^{\bullet+1}( J^\infty_\Sigma(E) )

is what remains of the full de Rham differential when the total spacetime derivative (horizontal derivative) is subtracted:

(36)

\delta \coloneqq \mathbf{d} - d \,.

We may then extend the horizontal derivative from functions on the jet bundle to all differential forms on the jet bundle by declaring that

d \circ \mathbf{d} \;\coloneqq\; - \mathbf{d} \circ d

which by (36) is equivalent to

(37)

d \circ \;\delta\; = - \delta \circ d \,.

For example

\begin{aligned} d \delta \phi & = - \delta d \phi \\ & = - \delta \left( \phi_{,\mu} d x^\mu \right) \\ & = - \delta \phi_{,\mu} \wedge d x^\mu \,. \end{aligned}

This defines a bigrading on the de Rham complex of $J^\infty_\Sigma(E)$ , into horizontal degree $r$ and vertical degree $s$

\Omega^\bullet\left( J^\infty_\Sigma(E) \right) \;\coloneqq\; \underset{r,s}{\oplus} \Omega^{r,s}(E)

such that the horizontal and vertical derivative increase horizontal or vertical degree, respectively:

(38)

\array{ C^\infty(J^\infty_\Sigma(E)) = & \Omega^{0,0}(E) &\overset{d}{\longrightarrow}& \Omega^{1,0}_\Sigma(E) &\overset{d}{\longrightarrow}& \Omega^{2,0}_\Sigma(E) &\overset{d}{\longrightarrow}& \cdots &\overset{d}{\longrightarrow}& \Omega^{p+1,0}_\Sigma(E) \\ & \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \cdots && \downarrow^{\mathrlap{\delta}} \\ & \Omega^{0,1}_\Sigma(E) &\overset{d}{\longrightarrow}& \Omega^{1,1}_\Sigma(E) &\overset{d}{\longrightarrow}& \Omega^{2,1}_\Sigma(E) &\overset{d}{\longrightarrow}& \cdots &\overset{d}{\longrightarrow}& \Omega^{p+1,1}_\Sigma(E) \\ & \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \cdots && \downarrow^{\mathrlap{\delta}} \\ & \Omega^{0,2}(E) &\overset{d}{\longrightarrow}& \Omega^{1,2}(E) &\overset{d}{\longrightarrow}& \Omega^{2,2}(E) &\overset{d}{\longrightarrow}& \cdots &\overset{d}{\longrightarrow}& \Omega^{p+1,2}_\Sigma(E) \\ & \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \downarrow^{\mathrlap{\delta}} && \cdots && \downarrow^{\mathrlap{\delta}} \\ & \vdots && \vdots && \vdots } \,.

This is called the variational bicomplex.

Accordingly we will refer to the differential forms on the jet bundle often as variational differential forms.

$\,$

derivatives on jet bundle

def.	symbols	name in physics	name in mathematics
def.	$\; \mathbf{d}$	de Rham differential	de Rham differential
	$\; d \coloneqq d x^\mu \frac{d}{d x^\mu}$	total spacetime derivative	horizontal derivative
	$\; \frac{d}{d x^\mu} \coloneqq \frac{\partial}{\partial x^\mu} + \phi^a_{,\mu} \frac{\partial}{\partial \phi^a} + \cdots$	total spacetime derivative along $\partial_\mu$	horizontal derivative along $\partial_\mu$
	$\; \delta \coloneqq \mathbf{d} - d$	variational derivative	vertical derivative
	$\; \delta_{EL} \mathbf{L} \coloneqq \mathbf{d}\mathbf{L} + d \Theta_{BFV}$	Euler-Lagrange variation	Euler-Lagrange operator
	$\; s_{BV}$	BV-differential	Koszul differential
	$\; s_{BRST}$	BRST differential	Chevalley-Eilenberg differential
	$\; s$	BV-BRST differential	Chevalley-Eilenberg-Koszul-Tate differential
	$\; s - d$	local BV-BRST differential

$\,$

Example

(basic facts about variational calculus)

Given the jet bundle of a field bundle as in def. , then in its variational bicomplex (def. ) we have the following:

The spacetime total derivative (horizontal derivative) of a spacetime coordinate function $x^\mu$ coincides with its ordinary de Rham differential

$\begin{aligned} d x^\mu & = \frac{\partial x^\mu}{ \partial x^\nu} \mathbf{d}x^\nu \\ & = \mathbf{d} x^\mu \end{aligned}$

which hence is a horizontal 1-form

$\mathbf{d}x^\mu \;\in\; \Omega^{1,0}_\Sigma(E) \,.$
Therefore the variational derivative (vertical derivative) of a spacetime coordinate function vanishes:

(39) $\delta x^\mu = 0 \,,$

reflective the fact that $x^\mu$ is not a field coordinate that could be varied.
In particular the given volume form on $\Sigma$ gives a horizontal $p+1$ -form on the jet bundle, which has the same coordinate expression (and which we denote by the same symbol)

$dvol_\Sigma = d x^0 \wedge d x^1 \wedge \cdots \wedge d x^p \;\in\; \Omega^{p+1,0} \,.$
Generally any horizontal $k$ -form is of the form

$f_{\mu_1 \cdots \mu_k} d x^{\mu_1} \wedge \cdots \wedge d x^{\mu_k} \;\in\; \Omega^{k,0}_{\Sigma}(E)$

for

$f_{\mu_1 \cdots \mu_k} = f_{\mu_1 \cdots \mu_k}\left((x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots\right) \in C^\infty(J^\infty_\Sigma(E))$

any smooth function of the spacetime coordinates and the field coordinates (locally depending only on a finite order of these, by prop. ).
In particular every horizontal $(p+1)$ -form $\mathbf{L} \in \Omega^{p+1,0}(E)$ is proportional to the above volume form

$\mathbf{L} = L \, dvol_\Sigma$

for $L = L((x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots)$ some smooth function that may depend on all the spacetime and field coordinates.
The spacetimes total derivatives /horizontal derivatives) of the variational derivative (vertical derivative) $\delta \phi$ of a field variable is the differential 2-form of horizontal degree 1 and vertical degree 1 given by

$\begin{aligned} d (\delta \phi^a) & = - \delta (d \phi_a) \\ & = - (\delta \phi^a_{,\mu}) \wedge \mathbf{d} x^\mu \end{aligned} \,.$

In words this says that “the spacetime derivative of the variation of the field is the variation of its spacetime derivative”.

The following are less trivial properties of variational differential forms:

Proposition

(pullback along jet prolongation compatible with total spacetime derivatives)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle over a spacetime $\Sigma$ (def. ), with induced jet bundle $J^\infty_\Sigma(E)$ (def. ).

Then for $\Phi \in \Gamma_\Sigma(E)$ any field history, the pullback of differential forms (def. )

j^\infty_\Sigma(\Phi)^\ast \;\colon\; \Omega^\bullet(J^\infty_\Sigma(E)) \longrightarrow \Omega^\bullet(\Sigma)

along the jet prolongation of $\Phi$ (def. )

intertwines the de Rham differential on spacetime (def. ) with the total spacetime derivative (horizontal derivative) on the jet bundle (def. ):

$d \circ j^\infty_\Sigma(\Phi)^\ast \;=\; j^\infty_\Sigma(\Phi)^\ast \circ d \,.$
annihilates all vertical differential forms (def. ):

$j^\infty_\Sigma(\Phi)^\ast\vert_{\Omega^{r, \geq 1}_\Sigma(E)} = 0 \,.$

Proof

The operation of pullback of differential forms along any smooth function intertwines the full de Rham differentials (prop. ). In particular we have that

d \circ j^\infty_\Sigma(\Phi)^\ast = j^\infty_\Sigma(\Phi)^\ast \circ \mathbf{d} \,.

This means that the second statement immediately follows from the first, by definition of the variational (vertical) derivative as the difference between the full de Rham differential and the horizontal one:

\begin{aligned} j^\infty_\Sigma(\Phi)^\ast \circ \delta & = j^\infty_\Sigma(\Phi)^\ast \circ (\mathbf{d} - d) \\ & = (d - d) \circ j^\infty_\Sigma(\Phi)^\ast \\ & = 0 \end{aligned}

It remains to see the first statement:

Since the jet prolongation $j^\infty_\Sigma(\Phi)$ preserves the spacetime coordinates $x^\mu$ (being a section of the jet bundle) it is immediate that the claimed relation is satisfied on the horizontal basis 1-forms $\mathbf{d}x^\mu = d x^\mu$ (example ):

d j^\infty_\Sigma(\Phi)^\ast( \mathbf{d}x^\mu ) = d^2 x^\mu = 0 \phantom{AAAAA} j^\infty_\Sigma(\Phi)^\ast d \mathbf{d} x^\mu = j^\infty_\Sigma(\Phi)^\ast d^2 x^\mu \,.

Therefore it finally remains only to check the first statement on smooth functions (0-forms). So let

f = f\left( (x^\mu) \,,\, (\phi^a) \,,\, ( \phi^a_{,\mu} ) \,,\, \cdots \right)

be a smooth function on the jet bundle. Then by the chain rule

\begin{aligned} d j^\infty_\Sigma(\Phi)^\ast f\left( (x^\mu) \,,\, (\phi^a) \,,\, ( \phi^a_{,\mu} ) \,,\, \cdots \right) & = d f\left( (x^\mu) \,,\, (\Phi^a) \,,\, \left( \frac{\partial \Phi^a}{\partial x^\mu} \right) \,,\, \cdots \right) \\ & = \left( \frac{\partial f}{\partial x^\mu} + \frac{\partial f}{\partial \phi^a} \frac{\partial \Phi^a}{\partial x^\mu} + \frac{\partial f}{\partial \phi^a_{,\nu}} \frac{\partial^2 \Phi^a}{\partial x^\nu \partial x^\mu} + \cdots \right) d x^\mu \end{aligned}

That this is equal to $j^\infty_\Sigma(\Phi)^\ast d f$ follows by the very definition of the total spacetime derivative of $f$ (34).

Proposition

(horizontal variational complex of trivial field bundle is exact)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle which is a trivial vector bundle over Minkowski spacetime (example ). Then the chain complex of horizontal differential forms $\Omega^{s,0}_\Sigma(E)$ with the total spacetime derivative (horizontal derivative) $d$ (def. )

(40)

\mathbb{R} \overset{}{\hookrightarrow} \Omega^{0,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{1,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{2,0}_\Sigma(E) \overset{d}{\longrightarrow} \cdots \overset{d}{\longrightarrow} \Omega^{p,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{p+1,0}_\Sigma(E)

is exact: for all $0 \leq s \leq p$ the kernel of $d$ coincides with the image of $d$ in $\Omega^{s,0}_\Sigma(E)$ .

More explicitly, this means that not only is every horizontally exact differential form $\omega = d \alpha$ horizontally closed $d \omega = 0$ (which follows immediately from the fact that we have a cochain complex in the first place, hence that $d^2 = 0$ ), but, conversely, if $\omega \in \Omega^{0 \leq s \leq p,0}_\Sigma(E)$ satisfies $d \omega = 0$ , then there exists $\alpha \in \Omega^{s-1,0}_\Sigma(E)$ with $\omega = d \alpha$ .

(e.g. Anderson 89, prop. 4.3)

Remark

(Euler-Lagrange complex)

In fact the exact sequence (40) from prop. continues further to the right, as such called the Euler-Lagrange complex. The next differential is the Euler-Lagrange operator and then then next is the Helmholtz operator.

Here we do not discuss this in detail, but we encounter aspects of the exactness further to the right below in example and in prop. .

$\,$

This concludes our discussion of variational calculus on the jet bundle of the field bundle. In the next chapter we apply this to Lagrangian densities on the jet bundle, defining Lagrangian field theories.

Lagrangians

In this chapter we discuss the following topics:

Lagrangian densities
Euler-Lagrange forms and Presymplectic currents
Euler-Lagrange equations of motion

$\,$

Given any type of fields (def. ), those field histories that are to be regarded as “physically realizable” (if we think of the field theory as a description of the observable universe) should satisfy some differential equation – the equation of motion – meaning that realizability of any field histories may be checked upon restricting the configuration to the infinitesimal neighbourhoods (example ) of each spacetime point. This expresses the physical absence of “action at a distance” and is one aspect of what it means to have a local field theory. By remark this means that equations of motion of a field theory are equations among the coordinates of the jet bundle of the field bundle.

For many field theories of interest, their differential equation of motion is not a random partial differential equations, but is of the special kind that exhibits the “principle of extremal action” (prop. below) determined by a local Lagrangian density (def. below). These are called Lagrangian field theories, and this is what we consider here.

Namely among all the variational differential forms (def. ) two kinds stand out, namley the 0-forms in $\Omega^{0,0}_\Sigma(E)$ – the smooth functions – and the horizontal $p+1$ -forms $\Omega^{p+1,0}_\Sigma(E)$ – to be called the Lagrangian densities $\mathbf{L}$ (def. below) – since these occupy the two “corners” of the variational bicomplex (38). There is not much to say about the 0-forms, but the Lagrangian densities $\mathbf{L}$ do inherit special structure from their special position in the variational bicomplex:

Their variational derivative $\delta \mathbf{L}$ uniquely decomposes as

the Euler-Lagrange derivative $\delta_{EL} \mathbf{L}$ which is proportional to the variation of the fields (instead of their derivatives)
the total spacetime derivative $d \Theta_{BFV}$ of a potential $\Theta_{BFV}$ for a presymplectic current $\Omega_{BFV} \coloneqq \delta \Theta_{BFV}$ .

This is prop. below:

\delta \mathbf{L} \;=\; \underset{ \text{Euler-Lagrange variation} }{\underbrace{\delta_{EL}\mathbf{L}}} - d \underset{\text{presymplectic current}}{\underbrace{\Theta_{BFV}}} \,.

These two terms play a pivotal role in the theory: The condition that the first term vanishes on field histories is a differential equation on field histories, called the Euler-Lagrange equation of motion (def. below). The space of solutions to this differential equation, called the on-shell space of field histories

(41)

\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} \overset{\phantom{AAA}}{\hookrightarrow} \Gamma_\Sigma(E)

has the interpretation of the space of “physically realizable field histories”. This is the key object of study in the following chapters. Often this is referred to as the space of classical field histories, indicating that this does not yet reflect the full quantum field theory.

Indeed, there is also the second term in the variational derivative of the Lagrangian density, the presymplectic current $\Theta_{BFV}$ , and this implies a presymplectic structure on the on-shell space of field histories (def. below) which encodes deformations of the algebra of smooth functions on $\Gamma_\Sigma(E)$ . This deformation is the quantization of the field theory to an actual quantum field theory, which we discuss below.

\array{ &&& \delta \mathbf{L} \\ &&& = \\ & & \delta_{EL}\mathbf{L} &- & d \Theta_{BFV} & \\ & \swarrow && && \searrow \\ \array{ \text{classical} \\ \text{field theory} } && && && \array{ \text{deformation to} \\ \text{quantum} \\ \text{field theory} } }

$\,$

Lagrangian densities

Definition

(local Lagrangian density)

Given a field bundle $E$ over a $(p+1)$ -dimensional Minkowski spacetime $\Sigma$ as in example , then a local Lagrangian density $\mathbf{L}$ (for the type of field thus defined) is a horizontal differential form of degree $(p+1)$ (def. ) on the corresponding jet bundle (def. ):

\mathbf{L} \;\in \; \Omega^{p+1,0}_{\Sigma}(E) \,.

By example in terms of the given volume form on spacetimes, any such Lagrangian density may uniquely be written as

(42)

\mathbf{L} = L \, dvol_\Sigma

where the coefficient function (the Lagrangian function) is a smooth function on the spacetime and field coordinates:

L = L((x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots ) \,.

where by prop. $L((x^\mu), \cdots)$ depends locally on an arbitrary but finite order of derivatives $\phi^a_{,\mu_1 \cdots \mu_k}$ .

We say that a field bundle $E \overset{fb}{\to} \Sigma$ (def. ) equipped with a local Lagrangian density $\mathbf{L}$ is (or defines) a prequantum Lagrangian field theory on the spacetime $\Sigma$ .

Remark

(parameterized and physical unit-less Lagrangian densities)

More generally we may consider parameterized collections of Lagrangian densities, i.e. functions

\mathbf{L}_{(-)} \;\colon\; U \longrightarrow \Omega^{p+1,0}_\Sigma(E)

for $U$ some Cartesian space or generally some super Cartesian space.

For example all Lagrangian densities considered in relativistic field theory are naturally smooth functions of the scale of the metric $\eta$ (def. )

\array{ \mathbb{R}_{\gt 0} &\overset{}{\longrightarrow}& \Omega^{p+1,0}_\Sigma(E) \\ r &\mapsto& \mathbf{L}_{r^2\eta} }

But by the discussion in remark , in physics a rescaling of the metric is interpreted as reflecting but a change of physical units of length/distance. Hence if a Lagrangian density is supposed to express intrinsic content of a physical theory, it should remain unchanged under such a change of physical units.

This is achieved by having the Lagrangian be parameterized by further parameters, whose corresponding physical units compensate that of the metric such as to make the Lagrangian density “physical unit-less”.

This means to consider parameter spaces $U$ equipped with an action of the multiplicative group $\mathbb{R}_{\gt 0}$ of positive real numbers, and parameterized Lagrangians

\mathbf{L}_{(-)} \;\colon\; U \longrightarrow \Omega^{p+1,0}_\Sigma(E)

which are invariant under this action.

Remark

(locally variational field theory and Lagrangian p-gerbe connection)

If the field bundle (def. ) is not just a trivial vector bundle over Minkowski spacetime (example ) then a Lagrangian density for a given equation of motion may not exist as a globally defined differential $(p+1)$ -form, but only as a p-gerbe connection. This is the case for locally variational field theories such as the charged particle, the WZW model and generally theories involving higher WZW terms. For more on this see the exposition at Higher Structures in Physics.

Example

(local Lagrangian density for free real scalar field on Minkowski spacetime)

Consider the field bundle for the real scalar field from example , i.e. the trivial line bundle over Minkowski spacetime.

According to def. its jet bundle $J^\infty_\Sigma(E)$ has canonical coordinates

\left\{ \{x^\mu\}, \phi, \{\phi_{,\mu}\}, \{\phi_{,\mu_1 \mu_2}\}, \cdots \right\} \,.

In these coordinates, the local Lagrangian density $L \in \Omega^{p+1,0}(\Sigma)$ (def. ) defining the free real scalar field of mass $m \in \mathbb{R}$ on $\Sigma$ is

L \coloneqq \tfrac{1}{2} \left( \eta^{\mu \nu} \phi_{,\mu} \phi_{,\nu} - m^2 \phi^2 \right) \mathrm{dvol}_\Sigma \,.

This is naturally thought of as a collection of Lagrangians smoothly parameterized by the metric $\eta$ and the mass $m$ . For this to be physical unit-free in the sense of remark the physical unit of the parameter $m$ must be that of the inverse metric, hence must be an inverse length according to remark This is the inverse Compton wavelength $\ell_m = \hbar / m c$ (9) and hence the physical unit-free version of the Lagrangian density for the free scalar particle is

\mathbf{L}_{\eta,\ell_m} \:\coloneqq\; \tfrac{\ell_m^2}{2} \left( \eta^{\mu \nu} \phi_{,\mu} \phi_{,\nu} - \left( \tfrac{m c}{\hbar} \right)^2 \phi^2 \right) \mathrm{dvol}_\Sigma \,.

Example

(phi^n theory)

Consider the field bundle for the real scalar field from example , i.e. the trivial line bundle over Minkowski spacetime. More generally we may consider adding to the free field Lagrangian density from example some power of the field coordinate

\mathbf{L}_{int} \;\coloneqq\; g \phi^n \, dvol_\Sigma \,,

for $g \in \mathbb{R}$ some number, here called the coupling constant.

The interacting Lagrangian field theory defined by the resulting Lagrangian density

\mathbf{L} + \mathbf{L}_{int} \;=\; \tfrac{1}{2} \left( \eta^{\mu \nu} \phi_{,\mu} \phi_{,\nu} - m^2 \phi^2 + g \phi^n \right) \mathrm{dvol}_\Sigma

is usually called just phi^n theory.

Example

(local Lagrangian density for free electromagnetic field)

Consider the field bundle $T^\ast \Sigma \to \Sigma$ for the electromagnetic field on Minkowski spacetime from example , i.e. the cotangent bundle, which over Minkowski spacetime happens to be a trivial vector bundle of rank $p+1$ . With fiber coordinates taken to be $(a_\mu)_{\mu = 0}^p$ , the induced fiber coordinates on the corresponding jet bundle $J^\infty_\Sigma(T^\ast \Sigma)$ (def. ) are $( (x^\mu), (a_\mu), (a_{\mu,\nu}), (a_{\mu,\nu_1 \nu_2}), \cdots )$ .

Consider then the local Lagrangian density (def. ) given by

(43)

\mathbf{L} \;\coloneqq\; \tfrac{1}{2} f_{\mu \nu} f^{\mu \nu} dvol_\Sigma \;\in\; \Omega^{p+1,0}_\Sigma(T^\ast \Sigma) \,,

where $f_{\mu \nu} \coloneqq \tfrac{1}{2}(a_{\nu,\mu} - a_{\mu,\nu})$ are the components of the universal Faraday tensor on the jet bundle from example .

This is the Lagrangian density that defines the Lagrangian field theory of free electromagnetism.

Here for $A \in \Gamma_\Sigma(T^\ast \Sigma)$ an electromagnetic field history (vector potential), then the pullback of $f_{\mu \nu}$ along its jet prolongation (def. ) is the corresponding component of the Faraday tensor (20):

\begin{aligned} \left( j^\infty_\Sigma(A) \right)^\ast(f_{\mu \nu}) & = (d A)_{\mu \nu} \\ & = F_{\mu \nu} \end{aligned}

It follows that the pullback of the Lagrangian (43) along the jet prologation of the electromagnetic field is

\begin{aligned} \left( j^\infty_\Sigma(A) \right)^\ast \mathbf{L} & = \tfrac{1}{2} F_{\mu \nu} F^{\mu \nu} dvol_\Sigma \\ & = \tfrac{1}{2} F \wedge \star_\eta F \end{aligned}

Here $\star_\eta$ denotes the Hodge star operator of Minkowski spacetime.

More generally:

Example

(Lagrangian density for Yang-Mills theory on Minkowski spacetime)

Let $\mathfrak{g}$ be a finite dimensional Lie algebra which is semisimple. This means that the Killing form invariant polynomial

k \colon \mathfrak{g} \otimes \mathfrak{g} \longrightarrow \mathbb{R}

is a non-degenerate bilinear form. Examples include the special unitary Lie algebras $\mathfrak{so}(n)$ .

Then for $E = T^\ast \Sigma \otimes \mathfrak{g}$ the field bundle for Yang-Mills theory as in example , the Lagrangian density (def. ) $\mathfrak{g}$ -Yang-Mills theory on Minkowski spacetime is

\mathbf{L} \;\coloneqq\; \tfrac{1}{2} k_{\alpha \beta} f^\alpha_{\mu \nu} f^{\beta \mu \nu} \, dvol_\Sigma \;\in\; \Omega^{p+1,0}_\Sigma(T^\ast \Sigma) \,,

where

f^\alpha_{\mu \nu} \;\coloneqq\; \tfrac{1}{2} \left( a^\alpha_{\nu,\mu} - a^\alpha_{\mu,\nu} + \gamma^{\alpha}{}_{\beta \gamma} a^\beta_{\mu} a^\gamma_{\nu} \right) \;\in\; \Omega^{0,0}_\Sigma(E)

is the universal Yang-Mills field strength (31).

For the purposes of perturbative quantum field theory (to be discussed below in chapter 15. Interacting quantum fields) we may allow for a rescaling of the structure constants by (at this point) a real number $g$ , to be called the coupling constant, and decompose the Lagrangian into a sum of a free field theory Lagrangian (def. ) and an interaction term:

\begin{aligned} \mathbf{L} & = \tfrac{1}{2} k_{\alpha \beta} \tfrac{1}{2} \left( a^\alpha_{\nu,\mu} - a^\alpha_{\mu,\nu} + g \gamma^{\alpha}{}_{\beta' \gamma'} a^{\beta'}_{\mu} a^{\gamma'}_{\nu} \right) \tfrac{1}{2} \left( a^{\beta\nu,\mu} - a^{\beta \mu,\nu} + g \gamma^{\beta}{}_{\beta'' \gamma''} a^{\beta''}_{\mu} a^{\gamma''}_{\nu} \right) \,dvol_\Sigma \\ & = \underset{ \mathbf{L}_{\mathrm{free}} }{ \underbrace{ \tfrac{1}{2} k_{\alpha \beta} \tfrac{1}{2} \left( a^\alpha_{\nu,\mu} - a^\alpha_{\mu,\nu} \right) \tfrac{1}{2} \left( a^{\beta\nu,\mu} - a^{\beta \mu,\nu} \right) \,dvol_\Sigma } } \\ & \phantom{=} + \underset{ \mathbf{L}_{int} }{ \underbrace{ g \, k_{\alpha \beta} \tfrac{1}{2} \left( a^\alpha_{\nu,\mu} - a^\alpha_{\mu,\nu} \right) \tfrac{1}{2} \left( \gamma^{\beta}{}_{\beta'' \gamma''} a^{\beta''}_{\mu} a^{\gamma''}_{\nu} \right) \,dvol_\Sigma \; + \; g^2 \, \tfrac{1}{2} k_{\alpha \beta} \tfrac{1}{2} \left( \gamma^{\alpha}{}_{\beta' \gamma'} a^{\beta'}_{\mu} a^{\gamma'}_{\nu} \right) \tfrac{1}{2} \left( \gamma^{\beta}{}_{\beta'' \gamma''} a^{\beta''}_{\mu} a^{\gamma''}_{\nu} \right) \,dvol_\Sigma } } \\ \end{aligned} \,,

Notice that $\mathbf{L}_{free}$ is equivalently a sum of $dim(\mathfrak{g})$ -copies of the Lagrangian for the electromagnetic field (example ).

On the other hand, for the purpose of exhibiting “non-perturbative effects due to instantons” in Yang-Mills theory, one consider the rescaled Yang-Mills field coordinate

\tilde a^\alpha_\mu \;\coloneqq\; \frac{1}{g} a^\alpha_\mu

with corresponding field strength

\tilde f^\alpha_{\mu \nu} \;\coloneqq\; \tfrac{1}{2} \left( \tilde a^\alpha_{\nu,\mu} - \tilde a^\alpha_{\mu,\nu} + \gamma^{\alpha}{}_{\beta \gamma} \tilde a^\beta_{\mu} \tilde a^\gamma_{\nu} \right) \;\in\; \Omega^{0,0}_\Sigma(E) \,.

In terms of this the expression for the Lagrangian is brought back to the abstract form it had before rescaling the structure constants by the coupling constant, up to a global rescaling of all terms by the inverse square of the coupling constant:

(44)

\mathbf{L} \;=\; \frac{1}{g^2} \tfrac{1}{2} k_{\alpha \beta} \tilde f^\alpha_{\mu \nu} \tilde f^{\beta \mu \nu} \, dvol_\Sigma \,.

Example

(local Lagrangian density for free B-field)

Consider the field bundle $\wedge^2_\Sigma T^\ast \Sigma \to \Sigma$ for the B-field on Minkowski spacetime from example . With fiber coordinates taken to be $(b_{\mu \nu})$ with

b_{\mu \nu} = - b_{\nu \mu} \,,

the induced fiber coordinates on the corresponding jet bundle $J^\infty_\Sigma(T^\ast \Sigma)$ (def. ) are $( (x^\mu), (b_{\mu \nu}), (b_{\mu \nu, \mu_1}), (b_{\mu \nu, \mu_1 \mu_2}), \cdots )$ .

Consider then the local Lagrangian density (def. ) given by

(45)

\mathbf{L} \;\coloneqq\; \tfrac{1}{2} h_{\mu_1 \mu_2 \mu_3} h^{\mu_1 \mu_2 \mu_3} \, dvol_\Sigma \;\in\; \Omega^{p+1,0}_\Sigma(\wedge^2_\Sigma T^\ast \Sigma) \,,

where $h_{\mu_1 \mu_2 \mu_3}$ are the components of the universal B-field strength on the jet bundle from example .

Example

(Lagrangian density for free Dirac field on Minkowski spacetime)

For $\Sigma$ Minkowski spacetime of dimension $p + 1 \in \{3,4,6,10\}$ (def. ), consider the field bundle $\Sigma \times S_{odd} \to \Sigma$ for the Dirac field from example . With the two-component spinor field fiber coordinates from remark , the jet bundle has induced fiber coordinates as follows:

\left( \left(\psi^\alpha\right) , \left( \psi^\alpha_{,\mu} \right) , \cdots \right) \;=\; \left( \left( (\chi_a), (\chi_{a,\mu}), \cdots \right), \left( ( \xi^{\dagger \dot a}), (\xi^{\dagger \dot a}_{,\mu}), \cdots \right) \right)

All of these are odd-graded elements (def. ) in a Grassmann algebra (example ), hence anti-commute with each other, in generalization of (28):

(46)

\psi^\alpha_{,\mu_1 \cdots \mu_r} \psi^\beta_{,\mu_1 \cdots \mu_s} \;=\; - \psi^\beta_{,\mu_1 \cdots \mu_s} \psi^\alpha_{,\mu_1 \cdots \mu_r} \,.

The Lagrangian density (def. ) of the massless free Dirac field on Minkowski spacetime is

(47)

\mathbf{L} \;\coloneqq\; \overline{\psi} \, \gamma^\mu \psi_{,\mu}\, dvol_\Sigma \,,

given by the bilinear pairing $\overline{(-)}\Gamma(-)$ from prop. of the field coordinate with its first spacetime derivative and expressed here in two-component spinor field coordinates as in (15), hence with the Dirac conjugate $\overline{\psi}$ (14) on the left.

Specifically in spacetime dimension $p + 1 = 4$ , the Lagrangian function for the massive Dirac field of mass $m \in \mathbb{R}$ is

\begin{aligned} L & \coloneqq \underset{ \text{kinetic term} }{ \underbrace{ i \, \overline{\psi} \, \gamma^\mu \, \psi_{,\mu} } } + \underset{ \text{mass term} }{ \underbrace{ m \overline{\psi} \psi }} \end{aligned}

\mathbf{L}_{\eta,\ell_m} \;\coloneqq\; \ell_m \left( i \overline{\psi} \gamma^\mu \psi_{,\mu} + \left( \tfrac{m c}{\hbar} \right) \overline{\psi} \psi \right) dvol_\Sigma \,.

Remark

(reality of the Lagrangian density of the Dirac field)

The kinetic term of the Lagrangian density for the Dirac field form def. is a sum of two contributions, one for each chiral spinor component in the full Dirac spinor (remark ):

\begin{aligned} i \overline{\psi} \gamma^\mu \psi_{,\mu} & = i \underset{ -(\partial_\mu \xi^a ) \sigma^\mu_{a \dot c} \xi^{\dagger \dot c} + \partial_\mu(\chi^a \sigma^\mu_{a \dot c} \chi^{\dagger \dot c}) }{ \underbrace{ \xi^a \sigma^\mu_{a \dot c} \partial_\mu \xi^{\dagger \dot c} } } + \xi^\dagger_{\dot a} \tilde \sigma^{\mu \dot a c} \partial_\mu \xi_c \\ & = \xi^\dagger \tilde \sigma^\mu \partial_\mu \xi + \chi^\dagger \tilde \sigma^\mu \partial_\mu \chi + \partial_\mu(\xi \sigma^\mu \xi^\dagger) \end{aligned}

Here the computation shown under the brace crucially uses that all these jet coordinates for the Dirac field are anti-commuting, due to their supergeometric nature (46).

Notice that a priori this is a function on the jet bundle with values in $\mathbb{K}$ . But in fact for $\mathbb{K} = \mathbb{C}$ it is real up to a total spacetime derivative:, because

\begin{aligned} \left( i \chi^\dagger \tilde \sigma^\mu \partial_\mu\chi \right)^\dagger & = -i \left( \partial_\mu \chi\right)^\dagger \sigma^\mu \chi \\ & = i \chi^\dagger \sigma^\mu \partial_\mu \chi + i \partial_\mu\left( \chi^\dagger \sigma^\mu \chi \right) \end{aligned}

and similarly for $i \xi^\dagger \tilde \sigma^\mu \partial_\mu\xi$

(e.g. Dermisek I-9)

Example

(Lagrangian density for quantum electrodynamics)

Consider the fiber product of the field bundles for the electromagnetic field (example ) and the Dirac field (example ) over 4-dimensional Minkowski spacetime $\Sigma \coloneqq \mathbb{R}^{3,1}$ (def. ):

E \;\coloneqq\; \underset{ \array{ \text{electromagnetic} \\ \text{field} } }{\underbrace{T^\ast \Sigma}} \times \underset{ \array{ \text{Dirac} \\ \text{field} } }{ \underbrace{ S_{odd} } } \,.

This means that now a field history is a pair $(A,\Psi)$ , with $A$ a field history of the electromagnetic field and $\Psi$ a field history of the Dirac field.

On the resulting jet bundle consider the Lagrangian density

(48)

L_{int} \;\coloneqq\; i g \, \overline{\psi} \gamma^\mu \psi a_\mu

for $g \in \mathbb{R}$ some number, called the coupling constant. This is called the electron-photon interaction.

Then the sum of the Lagrangian densities for

the free electromagnetic field (example );
the free Dirac field (example )
the above electron-photon interaction

\mathbf{L}_{EM} + \mathbf{L}_{Dir} + \mathbf{L}_{int} \;=\; \left( \tfrac{1}{2} f_{\mu \nu} f^{\mu \nu} \;+\; i \, \overline{\psi} \, \gamma^\mu \, \psi_{,\mu} + m \overline{\psi} \psi \;+\; i g \, \overline{\psi} \gamma^\mu \psi a_\mu \right) \, dvol_\Sigma

defines the interacting field theory Lagrangian field theory whose perturbative quantization is called quantum electrodynamics.

In this context the square of the coupling constant

\alpha \coloneqq \frac{g^2}{4 \pi}

is called the fine structure constant.

$\,$

Euler-Lagrange forms and presymplectic currents

The beauty of Lagrangian field theory (def. ) is that a choice of Lagrangian density determines both the equations of motion of the fields as well as a presymplectic structure on the space of solutions to this equation (the “shell”), making it the “covariant phase space” of the theory. All this we discuss below. But in fact all this key structure of the field theory is nothing but the shadow (under “transgression of variational differential forms”, def. below) of the following simple relation in the variational bicomplex:

Proposition

(Euler-Lagrange form and presymplectic current)

Given a Lagrangian density $\mathbf{L} \in \Omega^{p+1,0}_\Sigma(E)$ as in def. , then its de Rham differential $\mathbf{d}\mathbf{L}$ , which by degree reasons equals $\delta \mathbf{L}$ , has a unique decomposition as a sum of two terms

(49)

\mathbf{d} \mathbf{L} = \delta_{EL} \mathbf{L} - d \Theta_{BFV}

such that $\delta_{EL}\mathbf{L}$ is proportional to the variational derivative of the fields (but not their derivatives, called a “source form”):

\delta_{EL} \mathbf{L} \;\in\; \Omega^{p+1,0}_{\Sigma}(E) \wedge \delta C^\infty(E) \;\subset\; \Omega^{p+1,1}_{\Sigma}(E) \,.

The map

\delta_{EL} \;\colon\; \Omega^{p+1,0}_{\Sigma}(E) \longrightarrow \Omega^{p+1,0}_{\Sigma}(E) \wedge \delta \Omega^{0,0}_{\Sigma}(E)

thus defined is called the Euler-Lagrange operator and is explicitly given by the Euler-Lagrange derivative:

(50)

\begin{aligned} \delta_{EL} L \, dvol_\Sigma & \coloneqq \frac{\delta_{EL} L}{\delta \phi^a} \delta \phi^a \wedge dvol_\Sigma \\ & \coloneqq \left( \frac{\partial L}{\partial \phi^a} - \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\mu}} + \frac{d^2}{d x^{\mu_1} d x^{\mu_2}} \frac{\partial L}{\partial \phi^a_{\mu_1, \mu_2}} - \cdots \right) \delta \phi^a \wedge dvol_\Sigma \,. \end{aligned}

The smooth subspace of the jet bundle on which the Euler-Lagrange form vanishes

(51)

\mathcal{E} \;\coloneqq\; \left\{ x \in J^\infty_\Sigma(E) \;\vert\; \delta_{EL}\mathbf{L}(x) = 0 \right\} \;\overset{i_{\mathcal{E}}}{\hookrightarrow}\; J^\infty_\Sigma(E) \,.

is called the shell. The smaller subspace on which also all total spacetime derivatives vanish (the “formally integrable prolongation”) is the prolonged shell

(52)

\mathcal{E}^\infty \;\coloneqq\; \left\{ x \in J^\infty_\Sigma(E) \;\vert\; \left( \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \delta_{EL}\mathbf{L} \right)(x) = 0 \right\} \overset{i_{\mathcal{E}^\infty}}{\hookrightarrow} J^\infty_\Sigma(E) \,.

Saying something holds “on-shell” is to mean that it holds after restriction to this subspace. For example a variational differential form $\alpha \in \Omega^{\bullet,\bullet}_\Sigma(E)$ is said to vanish on shell if $\alpha\vert_{\mathcal{E}^\infty} = 0$ .

The remaining term $d \Theta_{BFV}$ in (49) is unique, while the presymplectic potential

(53)

\Theta_{BFV} \in \Omega^{p,1}_{\Sigma}(E)

is not unique.

(For a field bundle which is a trivial vector bundle (example over Minkowski spacetime (def. ), prop. says that $\Theta_{BFV}$ is unique up to addition of total spacetime derivatives $d \kappa$ , for $\kappa \in \Omega^{p-1,1}_\Sigma(E)$ .)

One possible choice for the presymplectic current $\Theta_{BFV}$ is

(54)

\begin{aligned} \Theta_{BFV} & \coloneqq \phantom{+} \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a \; \wedge \iota_{\partial_\mu} dvol_\Sigma \\ & \phantom{=} + \left( \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta \phi^a_{,\nu} - \frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \delta \phi^a_{,\mu} \right) \wedge \iota_{\partial_\mu} dvol_\Sigma \\ & \phantom{=} + \cdots \,, \end{aligned}

where

\iota_{\partial_{\mu}} dvol_\Sigma \;\coloneqq\; (-1)^{\mu} d x^0 \wedge \cdots d x^{\mu-1} \wedge d x^{\mu+1} \wedge \cdots \wedge d x^p

denotes the contraction (def. ) of the volume form with the vector field $\partial_\mu$ .

The vertical derivative of a chosen presymplectic potential $\Theta_{BFV}$ is called a pre-symplectic current for $\mathbf{L}$ :

(55)

\Omega_{BFV} \;\coloneqq\; \delta \Theta_{BFV} \;\;\; \in \Omega^{p,2}_{\Sigma}(E) \,.

Given a choice of $\Theta_{BFV}$ then the sum

(56)

\mathbf{L} + \Theta_{BFV} \;\in\; \Omega^{p+1,0}_\Sigma(E) \oplus \Omega^{p,1}_\Sigma(E)

is called the corresponding Lepage form. Its de Rham derivative is the sum of the Euler-Lagrange variation and the presymplectic current:

(57)

\mathbf{d}( \mathbf{L} + \Theta_{BFV} ) \;=\; \delta_{EL} \mathbf{L} + \Omega_{BFV} \,.

(Its conceptual nature will be elucidated after the introduction of the local BV-complex in example below.)

Proof

Using $\mathbf{L} = L dvol_\Sigma$ and that $d \mathbf{L} = 0$ by degree reasons (example ), we find

\begin{aligned} \mathbf{d}\mathbf{L} & = \left( \frac{\partial L}{\partial \phi^a} \delta \phi^a + \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a_{,\mu} + \frac{\partial L}{\partial \phi^a_{,\mu_1 \mu_2}} \delta \phi^a_{,\mu_1 \mu_2} + \cdots \right) \wedge dvol_{\Sigma} \end{aligned} \,.

The idea now is to have $d \Theta_{BFV}$ pick up those terms that would appear as boundary terms under the integral $\int_\Sigma j^\infty_\Sigma(\Phi)^\ast \mathbf{d}L$ if we were to consider integration by parts to remove spacetime derivatives of $\delta \phi^a$ .

We compute, using example , the total horizontal derivative of $\Theta_{BFV}$ from (54) as follows:

\begin{aligned} d \Theta_{BFV} & = \left( d \left( \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a \right) + d \left( \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta \phi^a_{,\nu} - \frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{\mu \nu}} \delta \phi^a \right) + \cdots \right) \wedge \iota_{\partial_\mu} dvol_\Sigma \\ & = \left( \left( \left( d \frac{\partial L}{\partial \phi^a_{,\mu}} \right) \wedge \delta \phi^a - \frac{\partial L}{\partial \phi^a_{,\mu}} \delta d \phi^a \right) + \left( \left(d \frac{\partial L}{\partial \phi^a_{,\nu \mu}}\right) \wedge \delta \phi^a_{,\nu} - \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta d \phi^a_{,\nu} - \left( d \frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \right) \wedge \delta \phi^a + \frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \delta d \phi^a \right) + \cdots \right) \wedge \iota_{\partial_\mu} dvol_\Sigma \\ & = - \left( \left( \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a + \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a_{,\mu} \right) + \left( \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta \phi^a_{,\nu} + \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta \phi^a_{,\nu \mu} - \frac{d^2}{ d x^\mu d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \delta \phi^a - \frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \delta \phi^a_{,\mu} \right) + \cdots \right) \wedge dvol_\Sigma \,, \end{aligned}

where in the last line we used that

d x^{\mu_1} \wedge \iota_{\partial_{\mu_2}} dvol_\Sigma = \left\{ \array{ dvol_\Sigma &\vert& \text{if}\, \mu_1 = \mu_2 \\ 0 &\vert& \text{otherwise} } \right.

Here the two terms proportional to $\frac{d}{d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} \delta \phi^a_{,\mu}$ cancel out, and we are left with

d \Theta_{BFV} \;=\; - \left( \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\mu}} - \frac{d^2}{ d x^\mu d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu \nu}} + \cdots \right) \delta \phi^a \wedge dvol_\Sigma - \left( \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a_{,\mu} + \frac{\partial L}{\partial \phi^a_{,\nu \mu}} \delta \phi^a_{,\nu \mu} + \cdots \right) \wedge dvol_\Sigma

Hence $-d \Theta_{BFV}$ shares with $\mathbf{d} \mathbf{L}$ the terms that are proportional to $\delta \phi^a_{,\mu_1 \cdots \mu_k}$ for $k \geq 1$ , and so the remaining terms are proportional to $\delta \phi^a$ , as claimed:

\mathbf{d}\mathbf{L} + d \Theta_{BFV} = \underset{ = \delta_{EL}\mathbf{L} }{ \underbrace{ \left( \frac{\partial L}{\partial \phi^a} - \frac{d}{d x^\mu}\frac{\partial L}{\partial \phi^a_{,\mu}} + \frac{d^2}{d x^\mu d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu\nu}} + \cdots \right) \delta \phi^a \wedge dvol_\Sigma }} \,.

The following fact is immediate from prop. , but of central importance, we futher amplify this in remark below:

Proposition

(total spacetime derivative of presymplectic current vanishes on-shell)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ). Then the Euler-Lagrange form $\delta_{EL} \mathbf{L}$ and the presymplectic current (prop. ) are related by

d \Omega_{BFV} = - \delta(\delta_{EL}\mathbf{L}) \,.

In particular this means that restricted to the prolonged shell $\mathcal{E}^\infty \hookrightarrow J^\infty_\Sigma(E)$ (52) the total spacetime derivative of the presymplectic current vanishes:

(58)

d \Omega_{BFV} \vert_{\mathcal{E}^\infty} \;=\; 0 \,.

Proof

By prop. we have

\delta \mathbf{L} = \delta_{EL} \mathbf{L} - d \Theta_{BFV} \,.

The claim follows from applying the variational derivative $\delta$ to both sides, using (37): $\delta^2 = 0$ and $\delta \circ d = - d \circ \delta$ .

Many examples of interest fall into the following two special cases of prop. :

Example

(Euler-Lagrange form for spacetime-independent Lagrangian densities)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle $E$ is a trivial vector bundle $E \simeq \Sigma \times F$ over Minkowski spacetime $\Sigma$ (example ).

In general the Lagrangian density $\mathbf{L}$ is a function of all the spacetime and field coordinates

\mathbf{L} = L((x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots) dvol_\Sigma \,.

Consider the special case that $\mathbf{L}$ is spacetime-independent in that the Lagrangian function $L$ is independent of the spacetime coordinate $(x^\mu)$ . Then the same evidently holds for the Euler-Lagrange form $\delta_{EL}\mathbf{L}$ (prop. ). Therefore in this case the shell (52) is itself a trivial bundle over spacetime.

In this situation every point $\varphi$ in the jet fiber defines a constant section of the shell:

(59)

\Sigma \times \{\varphi\} \subset \mathcal{E}^\infty \,.

Example

(canonical momentum)

Consider a Lagrangian field theory $(E, \mathbf{L})$ (def. ) whose Lagrangian density $\mathbf{L}$

does not depend on the spacetime-coordinates (example );
depends on spacetime derivatives of field coordinates (hence on jet bundle coordinates) at most to first order.

Hence if the field bundle $E \overset{fb}{\to} \Sigma$ is a trivial vector bundle over Minkowski spacetime (example ) this means to consider the case that

\mathbf{L} \;=\; L\left( (\phi^a), (\phi^a_{,\mu}) \right) \wedge dvol_\Sigma \,.

Then the presymplectic current (def. ) is (up to possibly a horizontally exact part) of the form

(60)

\Omega_{BFV} \;=\; \delta p_a^\mu \wedge \delta \phi^a \wedge \iota_{\partial_\mu} dvol_\Sigma

where

(61)

p_a^\mu \;\coloneqq\; \frac{\partial L}{ \partial \phi^a_{,\mu}}

denotes the partial derivative of the Lagrangian function with respect to the spacetime-derivatives of the field coordinates.

Here

\begin{aligned} p_a & \coloneqq p_a^0 \\ & = \frac{\partial L}{\partial \phi^a_{,0}} \end{aligned}

is called the canonical momentum corresponding to the “canonical field coordinate” $\phi^a$ .

In the language of multisymplectic geometry the full expression

p_a^\mu \wedge \iota_{\partial_\mu} dvol_\Sigma \;\in\; \Omega^{p,1}_\Sigma(E)

is also called the “canonical multi-momentum”, or similar.

Proof

We compute:

\begin{aligned} \mathbf{d} \mathbf{L} & = \left( \frac{\partial L}{\partial \phi^a} \delta \phi^a + \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a_{,\mu} \right) \delta \phi^a \wedge dvol_\Sigma \\ & = \left( \frac{\partial L}{\partial \phi^a} - \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\mu}} \right) \wedge dvol_\Sigma - d \underset{ \Theta_{BFV} }{ \underbrace{ \left( \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a \right) \wedge \iota_{\partial_\mu} dvol_\Sigma } } \end{aligned} \,.

Hence

\begin{aligned} \Omega_{BFV} & \coloneqq \delta \Theta_{BFV} \\ & = \delta \left( \frac{\partial L}{\partial \phi^a_{,\mu}} \delta \phi^a_{,\mu} \wedge \iota_{\partial_\mu} dvol_\Sigma \right) \\ & = \delta \frac{\partial L}{\partial \phi^a_{,\mu}} \wedge \delta \phi^a_{,\mu} \wedge \iota_{\partial_\mu} dvol_\Sigma \\ & = \delta p_a^\mu \wedge \delta \phi^a \wedge \iota_{\partial_\mu} dvol_\Sigma \end{aligned}

Remark

(presymplectic current is local version of (pre-)symplectic form of Hamiltonian mechanics)

In the simple but very common situation of example the presymplectic current (def. ) takes the form (61)

\Omega_{BFV} \;=\; \delta p_a^\mu \wedge \delta \phi^a \wedge \iota_{\partial_\mu} dvol_\Sigma

with $\phi^a$ the field coordinates (“canonical coordinates”) and $p_a^\mu$ the “canonical momentum” (61).

Notice that this is of the schematic form “ $(\delta p_a \wedge \delta q^a) \wedge dvol_{\Sigma_p}$ ”, which is reminiscent of the wedge product of a symplectic form expressed in Darboux coordinates with a volume form for a $p$ -dimensional manifold. Indeed, below in Phase space we discuss that this presymplectic current “transgresses” (def. below) to a presymplectic form of the schematic form “ $d P_a \wedge d Q^a$ ” on the on-shell space of field histories (def. ) by integrating it over a Cauchy surface of dimension $p$ . In good situations this presymplectic form is in fact a symplectic form on the on-shell space of field histories (theorem below).

This shows that the presymplectic current $\Omega_{BFV}$ is the local (i.e. jet level) avatar of the symplectic form that governs the formulation of Hamiltonian mechanics in terms of symplectic geometry.

In fact prop. may be read as saying that the presymplectic current is a conserved current (def. below), only that it takes values not in smooth functions of the field coordinates and jets, but in variational 2-forms on fields. There is a conserved charge associated with every conserved current (prop. below) and the conserved charge associated with the presymplectic current is the (pre-)symplectic form on the phase space of the field theory (def. below).

Example

(Euler-Lagrange form and presymplectic current for free real scalar field)

Consider the Lagrangian field theory of the free real scalar field from example .

Then the Euler-Lagrange form and presymplectic current (prop. ) are

(62)

\delta_{EL}\mathbf{L} \;=\; \left(\eta^{\mu \nu} \phi_{,\mu \nu} - m^2 \right) \delta \phi \wedge dvol_\sigma \;\in\; \Omega^{p+1,1}_{\Sigma}(E) \,.

and

\Omega_{BFV} \;=\; \left(\eta^{\mu \nu} \delta \phi_{,\mu} \wedge \delta \phi \right) \wedge \iota_{\partial_\nu} dvol_{\Sigma} \;\in\; \Omega^{p,2}_{\Sigma}(E) \,,

respectively.

Proof

This is a special case of example , but we spell it out in detail again:

We need to show that Euler-Lagrange operator $\delta_{EL} \colon \Omega^{p+1,0}(\Sigma) \to \Omega^{p+1,1}_S(\Sigma)$ takes the local Lagrangian density for the free scalar field to

\delta_EL L \;=\; \left( \eta^{\mu \nu} \phi_{,\mu \nu} - m^2 \phi \right) \delta \phi \wedge \mathrm{dvol}_\Sigma \,.

First of all, using just the variational derivative (vertical derivative) $\delta$ is a graded derivation, the result of applying it to the local Lagrangian density is

\delta L \;=\; \left( \eta^{\mu \nu} \phi_{,\mu} \delta \phi_{,\nu} - m^2 \phi \delta \phi \right) \wedge \mathrm{dvol}_\Sigma \,.

By definition of the Euler-Lagrange operator, in order to find $\delta_{EL}\mathbf{L}$ and $\Theta_{BFV}$ , we need to exhibit this as the sum of the form $(-) \wedge \delta \phi - d \Theta_{BFV}$ .

The key to find $\Theta_{BFV}$ is to realize $\delta \phi_{,\nu}\wedge \mathrm{dvol}_\Sigma$ as a total spacetime derivative (horizontal derivative). Since $d \phi = \phi_{,\mu} d x^\mu$ this is accomplished by

\delta \phi_{,\nu} \wedge \mathrm{dvol}_\Sigma = \delta d \phi \wedge \iota_{\partial_\nu} \mathrm{dvol}_\Sigma \,,

where on the right we have the contraction (def. ) of the tangent vector field along $x^\nu$ into the volume form.

Hence we may take the presymplectic potential (53) of the free scalar field to be

(63)

\Theta_{BFV} \coloneqq \eta^{\mu \nu} \phi_{,\mu} \delta \phi \wedge \iota_{\partial_\nu} \mathrm{dvol}_\Sigma \,,

because with this we have

d \Theta_{BFV} = \eta^{\mu \nu} \left( \phi_{,\mu \nu} \delta \phi - \eta^{\mu \nu} \phi_{,\mu} \delta \phi_{,\nu} \right) \wedge \mathrm{dvol}_\Sigma \,.

In conclusion this yields the decomposition of the vertical differential of the Lagrangian density

\delta L = \underset{ = \delta_{EL} \mathcal{L} }{ \underbrace{ \left( \eta^{\mu \nu} \phi_{,\mu \nu} - m^2 \phi \right) \delta \phi \wedge \mathrm{dvol}_\Sigma } } - d \Theta_{BFV} \,,

which shows that $\delta_{EL} L$ is as claimed, and that $\Theta_{BFV}$ is a presymplectic potential current (53). Hence the presymplectic current itself is

\begin{aligned} \Omega_{BFV} &\coloneqq \delta \Theta_{BFV} \\ & = \delta \left( \eta^{\mu \nu} \phi_{,\mu} \delta \phi \wedge \iota_{\partial_\nu} \mathrm{dvol}_\Sigma \right) \\ & = \left(\eta^{\mu \nu} \delta \phi_{,\mu} \wedge \delta \phi \right) \wedge \iota_{\partial_\nu} dvol_{\Sigma} \end{aligned} \,.

Example

(Euler-Lagrange form for free electromagnetic field)

Consider the Lagrangian field theory of free electromagnetism from example .

The Euler-Lagrange variational derivative is

(64)

\delta_{EL} \mathbf{L} \;=\; - \frac{d}{d x^\mu} f^{\mu \nu} \delta a_\nu \,.

Hence the shell (51) in this case is

\mathcal{E} = \Sigma \times \left\{ \left( (a_\mu) , (a_{\mu,\mu_1}), (a_{\mu,\mu_1 \mu_2}), \cdots \right) \;\vert\; f^{\mu \nu}{}_{,\mu} = 0 \right\} \;\subset\; J^\infty_\Sigma(T^\ast \Sigma) \,.

Proof

By (50) we have

\begin{aligned} \frac{\delta_{EL} L}{\delta a_\mu} \delta a_\mu & = \left( \underset{ = 0 }{ \underbrace{ \frac{\partial}{\partial a_\mu} \tfrac{1}{2} a_{[\mu,\nu]} a^{[\mu,\nu]} } } - \frac{d}{d x^\rho} \frac{\partial}{\partial a_{\alpha,\rho}} \tfrac{1}{2} a_{[\mu,\nu]} a^{[\mu,\nu]} \right) \delta a_\alpha \\ & = - \tfrac{1}{2} \left( \frac{d}{d x^\rho} \frac{\partial}{\partial a_{\alpha,\rho}} a_{\mu,\nu} a^{[\mu,\nu]} \right) \delta a_\alpha \\ & = - \left( \frac{d}{d x^\rho} a^{[\alpha,\rho]} \right) \delta a_{\alpha} \\ & = - f^{\mu \nu}{}_{,\mu} \delta a_{\nu} \,. \end{aligned}

More generally:

Example

(Euler-Lagrange form for Yang-Mills theory on Minkowski spacetime)

Let $\mathfrak{g}$ be a semisimple Lie algebra and consider the Lagrangian field theory $(E,\mathbf{L})$ of $\mathfrak{g}$ -Yang-Mills theory from example .

Its Euler-Lagrange form (prop. ) is

\begin{aligned} \delta_{EL}\mathbf{L} & = - \left( f^{\mu \nu \alpha}_{,\mu} + \gamma^\alpha{}_{\beta' \gamma} a_\mu^{\beta'} f^{\mu \nu \gamma} \right) k_{\alpha \beta} \,\delta a_\mu^\beta \, dvol_\Sigma \,, \end{aligned}

where

f^\alpha_{\mu \nu} \;\in\; \Omega^{0,0}_\Sigma(E)

is the universal Yang-Mills field strength (31).

Proof

With the explicit form (50) for the Euler-Lagrange derivative we compute as follows:

\begin{aligned} \delta_{EL} \left( \tfrac{1}{2} k_{\alpha \beta} f^\alpha_{\mu\nu} f^{\beta \mu \nu} \right) & = \left( \left( \frac{\partial}{\partial a_{\mu'}^{\alpha'}} \left( a_{\nu,\mu}^\alpha + \tfrac{1}{2} \gamma^{\alpha}{}_{\alpha_2 \alpha_3} a_{\mu}^{\alpha_2} a_\nu^{\alpha_3} \right) \right) k_{\alpha \beta} f^{\beta \mu \nu} - \left( \frac{d}{d x^{\nu'}} \frac{\partial}{\partial a_{\mu',\nu'}^{\alpha'}} \left( a_{\nu,\mu}^\alpha + \tfrac{1}{2} \gamma^{\alpha}{}_{\alpha_2 \alpha_3} a_{\mu}^{\alpha_2} a_\nu^{\alpha_3} \right) \right) k_{\alpha \beta} f^{\beta \mu \nu} \right) \delta a_{\mu'}^{\alpha'} \\ & = \gamma^{\alpha}{}_{\alpha' \alpha_3} a_\nu^{\alpha_3} f^{\beta \mu \nu} k_{\alpha \beta} \delta a_{\mu}^{\alpha'} - \left( \frac{d}{d x^{\mu}} f^{\beta \mu \nu} \right) k_{\alpha \beta} \delta a_{\nu}^{\alpha} \\ &= - \left( f^{\alpha \mu \nu}_{,\mu} + \gamma^\alpha{}_{\beta \gamma} a_\mu^\beta f^{\gamma \mu \nu} \right) k_{\alpha \beta} \delta a_\nu^\beta \end{aligned}

In the last step we used that for a semisimple Lie algebra $\gamma_{\alpha \beta \gamma} \coloneqq k_{\alpha \alpha'} \gamma^{\alpha'}{}_{\beta \gamma}$ is totally skew-symmetric in its indices (this being the coefficients of the Lie algebra cocycle) which is in transgression with the Killing form invariant polynomial $k$ .

Example

(Euler-Lagrange form of free B-field)

Consider the Lagrangian field theory of the free B-field from example .

The Euler-Lagrange variational derivative is

\delta_{EL} \mathbf{L} \;=\; h^{\mu \nu \rho}{}_{,\rho} \delta b_{\mu \nu} \,,

where $h_{\mu_1 \mu_2 \mu_3}$ is the universal B-field strength from example .

Proof

By (50) we have

\begin{aligned} \frac{\delta_{EL} L}{\delta b_{\mu \nu}} \delta b_{\mu \nu} & = \left( \underset{ = 0 }{ \underbrace{ \frac{\partial}{\partial b_{\mu \nu}} \tfrac{1}{2} b_{[\mu_1 \mu_2, \mu_3]} b^{[\mu_1 \mu_2, \mu_3]} } } - \frac{d}{d x^\rho} \frac{\partial}{\partial b_{\mu \nu, \rho}} \tfrac{1}{2} b_{[\mu_1 \mu_2, \mu_3]} b^{[\mu_1 \mu_2, \mu_3]} \right) \delta b_{\mu \nu} \\ & = - \left( \frac{d}{d x^\rho} \frac{\partial}{\partial b_{\mu \nu, \rho}} \tfrac{1}{2} b_{\mu_1 \mu_2, \mu_3} b^{[\mu_1 \mu_2, \mu_3]} \right) \delta b_{\mu \nu} \\ & = - \left( \frac{d}{d x^\rho} b^{[\mu \nu, \rho]} \right) \delta b_{\mu \nu} \\ & = - h^{\mu \nu \rho}{}_{,\rho} \delta b_{\mu \nu} \,. \end{aligned}

Example

(Euler-Lagrange form and presymplectic current of Dirac field)

Consider the Lagrangian field theory of the Dirac field on Minkowski spacetime of dimension $p + 1 \in \{3,4,6,10\}$ (example ).

Then

the Euler-Lagrange variational derivative (def. ) in the case of vanishing mass $m$ is

$\delta_{EL} \mathbf{L} \;=\; 2 i\, \overline{\delta \psi} \,\gamma^\mu\, \psi_{,\mu} \, \wedge dvol_\Sigma$

and in the case that spacetime dimension is $p +1 = 4$ and arbitrary mass $m\in \mathbb{R}$ , it is

$\delta_{EL} \mathbf{L} \;=\; \left( \overline{\delta \psi} \left( i \gamma^\mu \psi_{,\mu} + m \psi \right) + \left( - i \gamma^\mu\overline{\psi_{,\mu}} + m \overline{\psi} \right) (\delta \psi) \right) \, dvol_\Sigma$
its presymplectic current (def. ) is

$\Omega_{BFV} \;=\; \overline{\delta \psi}\,\gamma^\mu \,\delta \psi \, \iota_{\partial_\mu} dvol_\Sigma$

Proof

In any case the canonical momentum of the Dirac field according to example is

\begin{aligned} p^\alpha_\mu & \coloneqq \frac{\partial }{\partial \psi^\alpha_{,\mu}} \left( i \overline {\psi} \, \gamma^\nu \, \psi_{,\nu} + m \overline{\psi} \psi \right) \\ & = \overline{\psi}^\beta (\gamma^\mu)_\beta{}^\alpha \end{aligned}

This yields the presymplectic current as claimed, by example .

Now regarding the Euler-Lagrange form, first consider the massless case in spacetime dimension $p+1 \in \{3,4,6,10\}$ , where

L \;=\; i \overline{\psi} \, \gamma^\mu \, \psi_{,\mu} \,.

Then we compute as follows:

\begin{aligned} \delta_{EL} L & = i \,\overline{\delta \psi} \, \gamma^\mu \, \psi_{,\mu} \underset{ = + i \,\overline{\delta \psi} \, \gamma^\mu \, \psi_{,\mu} }{ \underbrace{ - i \overline{\psi_{,\mu}} \, \gamma^\mu \, \delta \psi } } \\ & = 2 i \, \overline{\delta \psi} \, \gamma^\mu \, \psi_{,\mu} \end{aligned}

Here the first equation is the general formula (50) for the Euler-Lagrange variation, while the identity under the braces combines two facts (as in remark above):

the symmetry (12) of the spinor pairing $\overline{(-)}\gamma^\mu(-)$ (prop. );
the anti-commutativity (46) of the Dirac field and jet coordinates, due to their supergeometric nature (remark ).

Finally in the special case of the massive Dirac field in spacetime dimension $p+1 = 4$ the Lagrangian function is

L \;=\; i \, \overline{\psi} \gamma^\mu \psi_{,\mu} + m \overline{\psi}\psi

where now $\psi_\alpha$ takes values in the complex numbers $\mathbb{C}$ (as opposed to in $\mathbb{R}$ , $\mathbb{H}$ or $\mathbb{O}$ ). Therefore we may now form the derivative equivalently by treeating $\psi$ and $\overline{\psi}$ as independent components of the field. This immediately yields the claim.

Example

(trivial Lagrangian densities and the Euler-Lagrange complex)

If a Lagrangian density $\mathbf{L}$ (def. ) is in the image of the total spacetime derivative, hence horizontally exact (def. )

\mathbf{L} \;=\; d \mathbf{\ell}

for any $\mathbf{\ell} \in \Omega^{p,0}_\Sigma(E)$ , then both its Euler-Lagrange form as well as its presymplectic current (def. ) vanish:

\delta_{EL}\mathbf{L} = 0 \phantom{AA} \,, \phantom{AA} \Omega_{BFV} = 0 \,.

This is because with $\delta \circ d = - d \circ \delta$ (37) the defining unique decomposition (49) of $\delta \mathbf{L}$ is given by

\begin{aligned} \delta \mathbf{L} & = \delta d \mathbf{\ell} \\ & = \underset{= \delta_{EL}\mathbf{L}}{\underbrace{0}} - d \underset{\Theta_{BFV}}{\underbrace{\delta \mathbf{l}}} \end{aligned}

which then implies with (55) that

\begin{aligned} \Omega_{BFV} & \coloneqq \delta \Theta_{BFV} \\ & = \delta \delta \mathbf{\ell} \\ & = 0 \end{aligned}

Therefore the Lagrangian densities which are total spacetime derivatives are also called trivial Lagrangian densities.

If the field bundle $E \overset{fb}{\to} \Sigma$ is a trivial vector bundle (example ) over Minkowski spacetime (def. ) then also the converse is true: Every Lagrangian density whose Euler-Lagrange form vanishes is a total spacetime derivative.

Stated more abstractly, this means that the exact sequence of the total spacetime from prop. extends to the right via the Euler-Lagrange variational derivative $\delta_{EL}$ to an exact sequence of the form

\mathbb{R} \overset{}{\hookrightarrow} \Omega^{0,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{1,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{2,0}_\Sigma(E) \overset{d}{\longrightarrow} \cdots \overset{d}{\longrightarrow} \Omega^{p,0}_\Sigma(E) \overset{d}{\longrightarrow} \Omega^{p+1,0}_\Sigma(E) \overset{\delta_{EL}}{\longrightarrow} \Omega^{p+1,0}_\Sigma(E) \wedge \delta(C^\infty(E)) \overset{\delta_{H}}{\longrightarrow} \cdots \,.

In fact, as shown, this exact sequence keeps going to the right; this is also called the Euler-Lagrange complex.

(Anderson 89, theorem 5.1)

The next differential $\delta_{H}$ after the Euler-Lagrange variational derivative $\delta_{EL}$ is known as the Helmholtz operator. By definition of exact sequence, the Helmholtz operator detects whether a partial differential equation on field histories, induced by a variational differential form $P \in \Omega^{p+1,0}_\Sigma(E) \wedge \delta(C^\infty(E))$ as in (65) comes from varying a Lagrangian density, hence whether it is the equation of motion of a Lagrangian field theory via def. .

This way homological algebra is brought to bear on core questions of field theory. For more on this see the exposition at Higher Structures in Physics.

Remark

(supergeometric nature of Lagrangian density of the Dirac field)

Observe that the Lagrangian density for the Dirac field (def. ) makes sense (only) due to the supergeometric nature of the Dirac field (remark ): If the field jet coordinates $\psi_{,\mu_1 \cdots \mu_k}$ were not anti-commuting (46) then the Dirac’s field Lagrangian density (def. ) would be a total spacetime derivative and hence be trivial according to example .

This is because

d \left( \tfrac{1}{2} \overline{\psi} \,\gamma^\mu\, \psi \, \iota_{\partial_\mu} dvol_\Sigma \right) = \tfrac{1}{2} \overline{\psi_{,\mu}} \,\gamma^\mu\, \psi \, dvol_\Sigma + \underset{ = (-1) \tfrac{1}{2} \overline{\psi_{,\mu}} \,\gamma^\mu\, \psi \, dvol_\Sigma }{ \underbrace{ \tfrac{1}{2}\overline{\psi} \,\gamma^\mu\, \psi_{,\mu} \, dvol_\Sigma }} \,.

Here the identification under the brace uses two facts:

the symmetry (12) of the spinor bilinear pairing $\overline{(-)}\Gamma (-)$ ;
the anti-commutativity (46) of the Dirac field and jet coordinates, due to their supergeometric nature (remark ).

The second fact gives the minus sign under the brace, which makes the total expression vanish, if the Dirac field and jet coordinates indeed are anti-commuting (which, incidentally, means that we found an “off-shell conserved current” for the Dirac field, see example below).

If however the Dirac field and jet coordinates did commute with each other, we would instead have a plus sign under the brace, in which case the total horizontal derivative expression above would equal the massless Dirac field Lagrangian (47), thus rendering it trivial in the sense of example .

The same supergeometric nature of the Dirac field will be necessary for its intended equation of motion, the Dirac equation (example ) to derive from a Lagrangian density; see the proof of example below, and see remark below.

$\,$

Euler-Lagrange equations of motion

The key implication of the Euler-Lagrange form on the jet bundle is that it induces the equation of motion on the space of field histories:

Definition

(Euler-Lagrange equation of motion)

Given a Lagrangian field theory $(E,\mathbf{L})$ (def. then the corresponding Euler-Lagrange equations of motion is the condition on field histories (def. )

\Phi_{(-)} \;\colon\; U \longrightarrow \Gamma_\Sigma(E)

to have a jet prolongation (def. )

j^\infty_\Sigma(\Phi_{(-)}(-) ) \;\colon\; U \times \Sigma \longrightarrow J^\infty_\Sigma(E)

that factors through the shell inclusion $\mathcal{E} \overset{i_{\mathcal{E}}}{\hookrightarrow} J^\infty_\Sigma(E)$ (51) defined by vanishing of the Euler-Lagrange form (prop. )

(65)

j^\infty_\Sigma(\Phi_{(-)}(-)) \;\colon\; U \times \Sigma \longrightarrow \mathcal{E} \overset{i_{\mathcal{E}}}{\hookrightarrow} J^\infty_\Sigma(E) \,.

(This implies that $j^\infty_\Sigma(\Phi_{(-)})$ factors even through the prolonged shell $\mathcal{E}^\infty \overset{i_{\mathcal{E}^\infty}}{\hookrightarrow} J^\infty_\Sigma(E)$ (52).)

In the case that the field bundle is a trivial vector bundle over Minkowski spacetime as in example this is the condition that $\Phi_{(-)}$ satisfies the following differential equation (again using prop. ):

\frac{\delta_{EL} L}{\delta \phi^a} \;\coloneqq\; \left( \frac{\partial L}{\partial \phi^a} - \frac{d}{d x^\mu} \frac{\partial L}{\partial \phi^a_{,\mu}} + \frac{d^2}{d x^\mu d x^\nu} \frac{\partial L}{\partial \phi^a_{,\mu\nu}} - \cdots \right) \left( (x^\mu), (\Phi^a), \left( \frac{\partial \Phi^a_{(-)}}{\partial x^\mu}\right), \left( \frac{\partial^2 \Phi^a_{(-)}}{\partial x^\mu \partial x^\nu} \right), \cdots \right) \;=\; 0 \,,

where the differential operator (def. )

(66)

j^\infty_\Sigma(-)^\ast \left( \frac{\delta_{EL}L}{\delta \phi^{(-)}} \right) \;\colon\; \Gamma_\Sigma(E) \longrightarrow \Gamma_\Sigma(T^\ast_\Sigma E)

from the field bundle (def. ) to its vertical cotangent bundle (def. ) is given by the Euler-Lagrange derivative (50).

The on-shell space of field histories is the space of solutions to this condition, namely the the sub-super smooth set (def. ) of the full space of field histories (22) (def. )

(67)

\Gamma_\Sigma(E)_{\delta_{EL} L = 0} \overset{\phantom{AAA}}{\hookrightarrow} \Gamma_\Sigma(E)

whose plots are those $\Phi_{(-)} \colon U \to \Gamma_\Sigma(E)$ that factor through the shell (65).

More generally for $\Sigma_r \hookrightarrow \Sigma$ a submanifold of spacetime, we write

(68)

\Gamma_{\Sigma_r}(E)_{\delta_{EL} L = 0} \overset{\phantom{AAA}}{\hookrightarrow} \Gamma_{\Sigma_r}(E)

for the sub-super smooth ste of on-shell field histories restricted to the infinitesimal neighbourhood of $\Sigma_r$ in $\Sigma$ (25).

Definition

(free field theory)

A Lagrangian field theory $(E, \mathbf{L})$ (def. ) with field bundle $E \overset{fb}{\to} \Sigma$ a vector bundle (e.g. a trivial vector bundle as in example ) is called a free field theory if its Euler-Lagrange equations of motion (def. ) is a differential equation that is linear differential equation, in that with

\Phi_1, \Phi_2 \;\in\; \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}

any two on-shell field histories (67) and $c_1, c_2 \in \mathbb{R}$ any two real numbers, also the linear combination

c_1 \Phi_1 + c_2 \Phi_2 \;\in\; \Gamma_\Sigma(E) \,,

which a priori exists only as an element in the off-shell space of field histories, is again a solution to the equations of motion and hence an element of $\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}$ .

A Lagrangian field theory which is not a free field theory is called an interacting field theory.

Remark

(relevance of free field theory)

In perturbative quantum field theory one considers interacting field theories in the infinitesimal neighbourhood (example ) of free field theories (def. ) inside some super smooth set of general Lagrangian field theories. While free field theories are typically of limited interest in themselves, this perturbation theory around them exhausts much of what is known about quantum field theory in general, and therefore free field theories are of paramount importance for the general theory.

We discuss the covariant phase space of free field theories below in Propagators and their quantization below in Free quantum fields.

Example

(equation of motion of free real scalar field is Klein-Gordon equation)

Consider the Lagrangian field theory of the free real scalar field from example .

By example its Euler-Lagrange form is

\delta_{EL}\mathbf{L} \;=\; \left(\eta^{\mu \nu} \phi_{,\mu \nu} - m^2 \right) \delta \phi \wedge dvol_\sigma

Hence for $\Phi \in \Gamma_\Sigma(E) = C^\infty(X)$ a field history, its Euler-Lagrange equation of motion according to def. is

\eta^{\mu \nu} \frac{\partial^2 }{\partial x^\mu \partial x^\nu} \Phi - m^2 \Phi \;=\; 0

often abbreviated as

(69)

(\Box - m^2) \Phi \;=\; 0 \,.

This PDE is called the Klein-Gordon equation on Minowski spacetime. If the mass $m$ vanishes, $m = 0$ , then this is the relativistic wave equation.

Hence this is indeed a free field theory according to def. .

The corresponding linear differential operator (def. )

(70)

(\Box - m^2) \;\colon\; \Gamma_\Sigma(\Sigma \times \mathbb{R}) \longrightarrow \Gamma_\Sigma(\Sigma \times \mathbb{R})

is called the Klein-Gordon operator.

For later use we record the following basic fact about the Klein-Gordon equation:

Example

(Klein-Gordon operator is formally self-adjoint )

The Klein-Gordon operator (70) is its own formal adjoint (def. ) witnessed by the bilinear differential operator (33) given by

(71)

K(\Phi_1, \Phi_2) \;\coloneqq\; \left( \frac{\partial \Phi_1}{\partial x^\mu} \Phi_2 - \Phi_1 \frac{\partial \Phi_2}{\partial x^\mu} \right) \eta^{\mu \nu}\iota_{\partial_\nu} dvol_\Sigma \,.

Proof

\begin{aligned} d K(\Phi_1, \Phi_2) & = d \left( \frac{\partial \Phi_1}{\partial x^\mu} \Phi_2 - \Phi_1 \frac{\partial \Phi_2}{\partial x^\mu} \right) \eta^{\mu \nu}\iota_{\partial_\nu} dvol_\Sigma \\ &= \left( \left( \eta^{\mu \nu}\frac{\partial^2 \Phi_1}{\partial x^\mu \partial x^\nu} \Phi_2 + \eta^{\mu \nu} \frac{\partial \Phi_1}{\partial x^\mu} \frac{\partial \Phi_2}{\partial x^\nu} \right) - \left( \eta^{\mu \nu} \frac{\partial \Phi_1}{\partial x^\nu} \frac{\partial \Phi_2}{\partial x^\mu} + \Phi_1 \eta^{\mu \nu} \frac{\partial^2 \Phi_2}{\partial x^\nu \partial x^\mu} \right) \right) dvol_\Sigma \\ & = \left( \eta^{\mu \nu}\frac{\partial^2 \Phi_1}{\partial x^\mu \partial x^\nu} \Phi_2 - \Phi_1 \eta^{\mu \nu} \frac{\partial^2 \Phi_2}{\partial x^\nu \partial x^\mu} \right) dvol_\Sigma \\ & = \Box(\Phi_1) \Phi_2 - \Phi_1 \Box (\Phi_2) \end{aligned}

Example

(equations of motion of vacuum electromagnetism are vacuum Maxwell's equations)

Consider the Lagrangian field theory of free electromagnetism on Minkowski spacetime from example .

By example its Euler-Lagrange form is

\delta_{EL}\mathbf{L} \;=\; \frac{d}{d x^\mu}f^{\mu \nu} \delta a_\nu \,.

Hence for $A \in \Gamma_{\Sigma}(T^\ast \Sigma) = \Omega^1(\Sigma)$ a field history (“vector potential”), its Euler-Lagrange equation of motion according to def. is

\begin{aligned} & \frac{\partial}{\partial x^\mu} F^{\mu \nu} = 0 \\ \Leftrightarrow\;\; & d \star_\eta F = 0 \end{aligned} \,,

where $F = d A$ is the Faraday tensor (20). (In the coordinate-free formulation in the second line “ $\star_\eta$ ” denotes the Hodge star operator induced by the pseudo-Riemannian metric $\eta$ on Minkowski spacetime.)

These PDEs are called the vacuum Maxwell's equations.

This, too, is a free field theory according to def. .

Example

(equation of motion of Dirac field is Dirac equation)

Consider the Lagrangian field theory of the Dirac field on Minkowski spacetime from example , with field fiber the spin representation $S$ regarded as a superpoint $S_{odd}$ and Lagrangian density given by the spinor bilinear pairing

L \;=\; i \overline{\psi} \gamma^\mu \partial_\mu \psi + m \overline{\psi}\psi

(in spacetime dimension $p+1 \in \{3,4,6,10\}$ with $m = 0$ unless $p+1 = 4$ ).

By example the Euler-Lagrange differential operator (66) for the Dirac field is of the form

(72)

\array{ \Gamma_\Sigma(\Sigma \times S) &\overset{ }{\longrightarrow}& \Gamma_\Sigma(\Sigma \times S^\ast) \\ \Psi &\mapsto& \overline{(-)} D \psi }

so that the corresponding Euler-Lagrange equation of motion (def. ) is equivalently

(73)

\underset{D}{ \underbrace{ \left(-i \gamma^\mu \partial_\mu + m\right) }} \psi \;=\; 0 \,.

This is the Dirac equation and $D$ is called a Dirac operator. In terms of the Feynman slash notation from (16) the corresponding differential operator, the Dirac operator reads

D \;=\; \left( - i \partial\!\!\!/\, + m \right) \,.

Hence this is a free field theory according to def. .

Observe that the “square” of the Dirac operator is the Klein-Gordon operator $\Box - m^2$ (69)

\begin{aligned} \left( +i \gamma^\mu \partial_\mu + m \right) \left(-i \gamma^\mu \partial_\mu + m\right)\psi & = \left(\partial_\mu \partial^\mu - m^2\right) \psi \\ & = \left(\Box - m^2\right) \psi \end{aligned} \,.

This means that a Dirac field which solves the Dirac equations is in particular (on Minkowski spacetime) componentwise a solution to the Klein-Gordon equation.

Remark

(supergeometric nature of the Dirac equation as an Euler-Lagrange equation)

While the Dirac equation (73) of example would make sense in itself also if the field coordinates $\psi$ and jet coordinates $\psi_{,\mu}$ of the Dirac field were not anti-commuting (46), due to their supergeometric nature (remark ), it would, by remark , then no longer be the Euler-Lagrange equation of a Lagrangian density, hence then Dirac field theory would not be a Lagrangian field theory.

Example

(Dirac operator on Dirac spinors is formally self-adjoint differential operator)

The Dirac operator, hence the differential operator corresponding to the Dirac equation of example via def. is a formally anti-self adjoint (def. ):

D^\ast = - D \,.

Proof

By (72) we are to regard the Dirac operator as taking values in the dual spin bundle by using the Dirac conjugate $\overline{(-)}$ (14):

\array{ \Gamma_\Sigma(\Sigma \times S) &\overset{}{\longrightarrow}& \Gamma_\Sigma(\Sigma \times S^\ast) \\ \Psi &\mapsto& \overline{(-)} D \Psi }

Then we need to show that there is $K(-,-)$ such that for all pairs of spinor sections $\Psi_1, \Psi_2$ we have

\overline{\Psi_2}\gamma^\mu (\partial_\mu \Psi_1) - \overline{\Psi_1}\gamma^\mu (-\partial_\mu \Psi_2) \;=\; d K(\psi_1, \psi_2) \,.

But the spinor-to-vector pairing is symmetric (12), hence this is equivalent to

\overline{\partial_\mu \Psi_1}\gamma^\mu \Psi_2 + \overline{\Psi_1}\gamma^\mu (\partial_\mu \Psi_2) \;=\; d K(\psi_1, \psi_2) \,.

By the product law of differentiation, this is solved, for all $\Psi_1, \Psi_2$ , by

K(\Psi_1, \Psi_2) \;\coloneqq\; \left( \overline{\Psi_1} \gamma^\mu \Psi_2\right) \, \iota_{\partial_\mu} dvol \,.

$\,$

This concludes our discussion of Lagrangian densities and their variational calculus. In the next chapter we consider the infinitesimal symmetries of Lagrangians and the conserved currents that these induce via Noether's theorem.

Symmetries

In this chapter we discuss these topics:

Infinitesimal symmetries of the Lagrangian density
Infinitesimal symmetries of the presymplectic potential current

$\,$

We have introduced the concept of Lagrangian field theories $(E,\mathbf{L})$ in terms of a field bundle $E$ equipped with a Lagrangian density $\mathbf{L}$ on its jet bundle (def. ). Generally, given any object equipped with some structure, it is of paramount interest to determine the symmetries, hence the isomorphisms/equivalences of the object that preserve the given structure (this is the “Erlanger program”, Klein 1872).

The infinitesimal symmetries of the Lagrangian density (def. below) send one field history to an infinitesimally nearby one which is “equivalent” for all purposes of field theory. Among these are the infinitesimal gauge symmetries which will be of concern below. A central theorem of variational calculus says that infinitesimal symmetries of the Lagrangian correspond to conserved currents, this is Noether's theorem I, prop. below. These conserved currents constitute an extension of the Lie algebra of symmetries, called the Dickey bracket.

But in (57) we have seen that the Lagrangian density of a Lagrangian field theory is just one component, in codimension 0, of an inhomogeneous “Lepage form” which in codimension 1 is given by the presymplectic potential current $\Theta_{BFV}$ (53). (This will be conceptually elucidated, after we have introduced the local BV-complex, in example below.) This means that in codimension 1 we are to consider infinitesimal on-shell symmetries of the Lepage form $\mathbf{L} + \Theta_{BFV}$ . These are known as Hamiltonian vector fields (def. below) and the analog of Noether's theorem I now says that these correspond to Hamiltonian differential forms. The Lie algebra of these infinitesimal symmetries is called the local Poisson bracket (prop. below).

Noether theorem and Hamiltonian Noether theorem

$\,$ variational form $\,$	$\,$ symmetry $\,$	$\,$ homotopy formula $\,$	$\,$ physical quantity $\,\,\,$	$\,$ local symmetry algebra $\,$
Lagrangian density $\mathbf{L}$ (def. )	$\mathcal{L}_v \mathbf{L} = d \tilde J$	$d(\underset{= J_v}{\underbrace{\tilde J - \iota_v \Theta_{BFV}}}) = \iota_v \, \delta_{EL}\mathbf{L}$	conserved current $J_v$ (def. )	Dickey bracket
presymplectic current $\Omega_{BFV}$ (prop. )	$\mathcal{L}^{var}_v \Theta_{BFV} = \delta \tilde H$	$\delta(\underset{= H_v}{\underbrace{\tilde H_v - \iota_v \Theta_{BFV}}}) = \iota_v \Omega_{BFV}$	Hamiltonian form $H_v$ (def. )	local Poisson bracket (prop. )

$\,$

In the chapter Phase space below we transgress this local Poisson bracket of infinitesimal symmetries of the presymplectic potential current to the “global” Poisson bracket on the covariant phase space (def. below). This is the structure which then further below leads over to the quantization (deformation quantization) of the prequantum field theory to a genuine perturbative quantum field theory. However, it will turn out that there may be an obstruction to this construction, namely the existence of special infinitesimal symmetries of the Lagrangian densities, called implicit gauge symmetries (discussed further below).

$\,$

infinitesimal symmetries of the Lagrangian density

Definition

(variation)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ).

A variation is a vertical vector field $v$ on the jet bundle $J^\infty_\Sigma(E)$ (def. ) hence a vector field which vanishes when evaluated in the horizontal differential forms.

In the special case that the field bundle is trivial vector bundle over Minkowski spacetime as in example , a variation is of the form

v = v^a \partial_{\phi^a} + v^a_{,\mu} \partial_{\phi^a_{,\mu}} + v^a_{\mu_1 \mu_2} \partial_{\phi^a_{\mu_1 \mu_2}} + \cdots

The concept of variation in def. is very general, in that it allows to vary the field coordinates independently from the corresponding jets. This generality is necessary for discussion of symmetries of presymplectic currents in def. below. But for discussion of symmetries of Lagrangian densities we are interested in explicitly varying just the field coordinates (def. below) and inducing from this the corresponding variations of the field derivatives (prop. ) below.

In order to motivate the following definition of evolutionary vector fields we follow remark saying that concepts in variational calculus are obtained from their analogous concepts in plain differential calculus by replacing plain bundle morphisms by morphisms out of the jet bundle:

Given a fiber bundle $E \overset{fb}{\to} \Sigma$ , then a vertical vector field on $E$ is a section of its vertical tangent bundle $T_\Sigma E$ (def. ), hence is a bundle morphism of this form

\array{ E && \overset{\text{vertical vector field}}{\longrightarrow} && T_\Sigma E \\ & {}_{\mathllap{id}}\searrow && \swarrow \\ && E }

The variational version replaces the vector bundle on the left with its jet bundle:

Definition

(evolutionary vector fields)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ). Then an evolutionary vector field $v$ on $E$ is “variational vertical vector field” on $E$ , hence a smooth bundle homomorphism out of the jet bundle (def. )

\array{ J^\infty_\Sigma E && \overset{v}{\longrightarrow} && T_\Sigma E \\ & {}_{\mathllap{jb_{\infty,0}}}\searrow && \swarrow_{\mathllap{}} \\ && E }

to the vertical tangent bundle $T_\Sigma E \overset{}{\to} \Sigma$ (def. ) of $E \overset{fb}{\to} \Sigma$ .

In the special case that the field bundle is a trivial vector bundle over Minkowski spacetime as in example , this means that an evolutionary vector field is a tangent vector field (example ) on $J^\infty_\Sigma(E)$ of the special form

\begin{aligned} v & = v^a \partial_{\phi^a} \\ & = v^a\left( (x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots \right) \partial_{\phi^a} \end{aligned} \,,

where the coefficients $v^a \in C^\infty(J^\infty_\Sigma(E))$ are general smooth functions on the jet bundle (while the cmponents are tangent vectors along the field coordinates $(\phi^a)$ , but not along the spacetime coordinates $(x^\mu)$ and not along the jet coordinates $\phi^a_{,\mu_1 \cdots \mu_k}$ ).

We write

\Gamma_E^{ev}\left( T_\Sigma E \right) \;\in\; \Omega^{0,0}_\Sigma(E) Mod

for the space of evolutionary vector fields, regarded as a module over the $\mathbb{R}$ -algebra

\Omega^{0,0}_\Sigma(E) \;=\; C^\infty\left( J^\infty_\Sigma(E) \right)

of smooth functions on the jet bundle.

An evolutionary vector field (def. ) describes an infinitesimal change of field values depending on, possibly, the point in spacetime and the values of the field and all its derivatives (locally to finite order, by prop. ).

This induces a corresponding infinitesimal change of the derivatives of the fields, called the prolongation of the evolutionary vector field:

Proposition

(prolongation of evolutionary vector field)

Let $E \overset{fb}{\to} \Sigma$ be a fiber bundle.

Given an evolutionary vector field $v$ on $E$ (def. ) there is a unique tangent vector field $\hat v$ (example ) on the jet bundle $J^\infty_\Sigma(E)$ (def. ) such that

$\hat v$ agrees on field coordinates (as opposed to jet coordinates) with $v$ :

$(jb_{\infty,0})_\ast(\hat v) = v \,,$

which means in the special case that $E \overset{fb}{\to} \Sigma$ is a trivial vector bundle over Minkowski spacetime (example ) that $\hat v$ is of the form

(74) $\hat v \;=\; \underset{ = v }{ \underbrace{ v^a \partial_{\phi^a} }} \,+\, \hat v^a_{\mu} \partial_{\phi^a_{,\mu}} + \hat v^a_{\mu_1 \mu_2} \partial_{\phi^a_{,\mu_1 \mu_2}} + \cdots$
contraction with $\hat v$ (def. ) anti-commutes with the total spacetime derivative (def. ):

(75) $\iota_{\hat v} \circ d + d \circ \iota_{\hat v} = 0 \,.$

In particular Cartan's homotopy formula (prop. ) for the Lie derivative $\mathcal{L}_{\hat v}$ holds with respect to the variational derivative $\delta$ :

(76)

\mathcal{L}_{\hat v} = \delta \circ \iota_{\hat v} + \iota_{\hat v} \circ \delta

Explicitly, in the special case that the field bundle is a trivial vector bundle over Minkowski spacetime (example ) $\hat v$ is given by

(77)

\hat v = \underoverset{n = 0}{\infty}{\sum} \frac{d^n v^a}{ d x^{\mu_1} \cdots d x^{\mu_n} } \partial_{\phi^a_{\mu_1 \cdots \mu_n}} \,.

Proof

It is sufficient to prove the coordinate version of the statement. We prove this by induction over the maximal jet order $k$ . Notice that the coefficient of $\partial_{\phi^a_{\mu_1 \cdots \mu_k}}$ in $\hat v$ is given by the contraction $\iota_{\hat v} \delta \phi^a_{\mu_1 \cdots \mu_k}$ (def. ).

Similarly (at “ $k = -1$ ”) the component of $\partial_{\mu_1}$ is given by $\iota_{\hat v} d x^{\mu}$ . But by the second condition above this vanishes:

\begin{aligned} \iota_{\hat v} d x^\mu & = d \iota_{\hat v} x^\mu \\ & = 0 \end{aligned}

Moreover, the coefficient of $\partial_{\phi^a}$ in $\hat v$ is fixed by the first condition above to be

\iota_{\hat v} \delta \phi^a = v^a \,.

This shows the statement for $k = 0$ . Now assume that the statement is true up to some $k \in \mathbb{N}$ . Observe that the coefficients of all $\partial_{\phi^a_{\mu_1 \cdots \mu_{k+1}}}$ are fixed by the contractions with $\delta \phi^a_{\mu_1 \cdots \mu_{k} \mu_{k+1}} \wedge d x^{\mu_{k+1}}$ . For this we find again from the second condition and using $\delta \circ d + d \circ \delta = 0$ as well as the induction assumption that

\begin{aligned} \iota_{\hat v} \delta \phi^a_{\mu_1 \cdots \mu_{k+1}} \wedge d x^{\mu_{k+1}} & = \iota_{\hat v} \delta d \phi^a_{\mu_1 \cdots \mu_k} \\ & = d \iota_{\hat v} \delta \phi^a_{\mu_1 \cdots \mu_k} \\ & = d \frac{d^k v^a}{d x^{\mu_1} \cdots d x^{\mu_k}} \\ & = \frac{d^{k+1}v^a }{d x^{\mu_1} \cdots d x^{\mu_{k+1}}} d x^{\mu_{k+1}} \,. \end{aligned}

This shows that $\hat v$ satisfying the two conditions given exists uniquely.

Finally formula (76) for the Lie derivative follows from the second of the two conditions with Cartan's homotopy formula $\mathcal{L}_{\hat v} = \mathbf{d} \circ \iota_{\hat v} + \iota_{\hat v} \circ \mathbf{d}$ (prop. ) together with $\mathbf{d} = \delta + d$ (35).

Proposition

(evolutionary vector fields form a Lie algebra)

Let $E \overset{fb}{\to} \Sigma$ be a fiber bundle. For any two evolutionary vector fields $v_1$ , $v_2$ on $E$ (def. ) the Lie bracket of tangent vector fields of their prolongations $\hat v_1$ , $\hat v_2$ (def. ) is itself the prolongation $\widehat{[v_1, v_2]}$ of a unique evolutionary vector field $[v_1,v_2]$ .

This defines the structure of a Lie algebra on evolutionary vector fields.

Proof

It is clear that $[\hat v_1, \hat v_2]$ is still vertical, therefore, by prop. , it is sufficient to show that contraction $\iota_{[v_1, v_2]}$ with this vector field (def. ) anti-commutes with the horizontal derivative $d$ , hence that $[d, \iota_{[\hat v_1, \hat v_2]}] = 0$ .

Now $[d, \iota_{[\hat v_1, \hat v_2]}]$ is an operator that sends vertical 1-forms to horizontal 1-forms and vanishes on horizontal 1-forms. Therefore it is sufficient to see that this operator in fact also vanishes on all vertical 1-forms. But for this it is sufficient that it commutes with the vertical derivative. This we check by Cartan calculus, using $[d,\delta] = 0$ and $[d, \iota_{\hat v_i}]=0$ , by assumption:

\begin{aligned} {[ \delta, [ d,\iota_{[\hat v_1, \hat v_2]}] ]} & = - [d, [\delta, \iota_{[\hat v_1, \hat v_2]}]] \\ & = - [d, \mathcal{L}_{[\hat v_1, \hat v_2]}] \\ & = -[d, [\mathcal{L}_{\hat v_1}, \iota_{\hat v_2}] ] \\ & = - [d, [ [\delta, \iota_{\hat v_1}], \iota_{\hat v_2} ]] \\ & = 0 \,. \end{aligned}

Now given an evolutionary vector field, we want to consider the flow that it induces on the space of field histories:

Definition

(flow of field histories along evolutionary vector field)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) and let $v$ be an evolutionary vector field (def. ) such that the ordinary flow of its prolongation $\hat v$ (prop. )

\exp(t \hat v) \;\colon\; J^\infty_\Sigma(E) \longrightarrow J^\infty_\Sigma(E)

exists on the jet bundle (e.g. if the order of derivatives of field coordinates that it depends on is bounded).

For $\Phi_{(-)} \colon U_1 \to \Gamma_\Sigma(E)$ a collection of field histories (hence a plot of the space of field histories (def. ) ) the flow of $v$ through $\Phi_{(-)}$ is the smooth function

U_1 \times \mathbb{R}^1 \overset{\exp(v)(\Phi_{(-)})}{\longrightarrow} \Gamma_\Sigma(E)

whose unique factorization $\widehat{\exp(v)}(\Phi_{(-)})$ through the space of jets of field histories (i.e. the image $im(j^\infty_\Sigma)$ of jet prolongation, def. )

\array{ && im(j^\infty_\Sigma) &\hookrightarrow& \Gamma_\Sigma(J^\infty_\Sigma(E)) \\ & {}^{\mathllap{\widehat{\exp(v)}(\Phi_{(-)})}} \nearrow& \downarrow^{\mathrlap{\simeq}} \\ U_1 \times \mathbb{R}^1 &\underset{ \exp(v)(\Phi) }{\longrightarrow}& \Gamma_{\Sigma}(E)_{} }

takes a plot $t_{(-)} \;\colon\; U_2 \to \mathbb{R}^1$ of the real line (regarded as a super smooth set via example ), to the plot

(78)

(\exp(t(-) \hat v) \circ j^\infty_\Sigma(\Phi_{(-)}) \;\colon\: U_1 \times U_2 \longrightarrow \Gamma_\Sigma\left( J^\infty_\Sigma(E) \right)

of the smooth space of sections of the jet bundle.

(That $\exp(t(-) \hat v)$ indeed flows jet prolongations $j^\infty_\Sigma(\Phi(-))$ again to jet prolongations is due to its defining relation to the evolutionary vector field $v$ from prop. .)

Definition

(infinitesimal symmetries of the Lagrangian and conserved currents)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

Then

an infinitesimal symmetry of the Lagrangian is an evolutionary vector field $v$ (def. ) such that the Lie derivative of the Lagrangian density along its prolongation $\hat v$ (prop. ) is a total spacetime derivative:

$\mathcal{L}_{\hat v} \mathbf{L} \;=\; d \tilde J_{\hat v}$
an on-shell conserved current is a horizontal $p$ -form $J \in \Omega^{p,0}_\Sigma(E)$ (def. ) whose total spacetime derivative vanishes on the prolonged shell (51)

$d J\vert_{\mathcal{E}^\infty} \;=\; 0 \,.$

Proposition

(Noether's theorem I)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

If $v$ is an infinitesimal symmetry of the Lagrangian (def. ) with $\mathcal{L}_{\hat v} \mathbf{L} = d \tilde J_{\hat v}$ , then

(79)

J_{\hat v} \coloneqq \tilde J_{\hat v} - \iota_{\hat v} \Theta_{BFV}

is an on-shell conserved current (def. ), for $\Theta_{BFV}$ a presymplectic potential (53) from def. .

(Noether's theorem II is prop. below.)

Proof

By Cartan's homotopy formula for the Lie derivative (prop. ) and the decomposition of the variational derivative $\delta \mathbf{L}$ (49) and the fact that contraction $\iota_{\hat v}$ with the prolongtion of an evolutionary vector field vanishes on horizontal differential forms (74) and anti-commutes with the horizontal differential (75), by def. , we may re-express the defining equation for the symmetry as follows:

\begin{aligned} d \tilde J_{\hat v} & = \mathcal{L}_{\hat v} \mathbf{L} \\ & = \iota_{\hat v} \underset{= \delta_{EL}\mathbf{L} - d \Theta_{BFV}}{\underbrace{\mathbf{d} \mathbf{L}}} + \mathbf{d} \underset{= 0}{\underbrace{\iota_v \mathbf{L}}} \\ & = \iota_{\hat v} \delta_{EL} \mathbf{L} + d \iota_{\hat v} \Theta_{BFV} \end{aligned}

which is equivalent to

(80)

d(\underset{= J_{\hat v}}{\underbrace{\tilde J_{\hat v} - \iota_{\hat v} \Theta_{BFV}}}) \;=\; \iota_{\hat v} \delta_{EL}\mathbf{L}

Since, by definition of the shell $\mathcal{E}$ , the differential form on the right vanishes on $\mathcal{E}$ this yields the claim.

Example

(energy-momentum of the scalar field)

Consider the Lagrangian field theory of the free scalar field from def. :

\mathbf{L} \;=\; \tfrac{1}{2} \left( \eta^{\mu \nu}\phi_{,\mu} \phi_{,\nu} - m^2 \phi^2 \right) dvol_\Sigma \,.

For $\nu \in \{0, 1, \cdots, p\}$ consider the vector field on the jet bundle given by

v_\nu \;\coloneqq\; \phi_{,\nu} \partial_{\phi} + \phi_{,\mu \nu} \partial_{\phi_{,\mu}} + \cdots \,.

This describes infinitesimal translations of the fields in the direction of $\partial_\nu$ .

And this is an infinitesimal symmetry of the Lagrangian (def. ), since

\iota_{v_\nu} \mathbf{d}\mathbf{L} = d L \wedge \iota_{\partial_\nu} dvol_\Sigma \,.

With the formula (63) for the presymplectic potential

\Theta_{BFV} = \eta^{\mu \nu} \phi_{,\mu} \delta \phi \iota_{\partial_{\nu}} dvol_\Sigma

it hence follows from Noether's theorem (prop. ) that the corresponding conserved current (def. ) is

\begin{aligned} T_\nu & = L \, \iota_{\partial_\nu} dvol_\Sigma - \iota_{v_\nu}\Theta_{BFV} \\ & = L \, \iota_{\partial_\nu} dvol_\Sigma - \eta^{\rho \mu} \phi_{,\rho} \phi_{,\nu} \, \iota_{\partial_\mu} dvol_\Sigma \\ & = ( \underset{=: T^\mu_\nu}{ \underbrace{ \delta^\mu_\nu L - \eta^{\rho \mu} \phi_{,\rho} \phi_{,\nu} } } ) \, \iota_{\partial_\mu} dvol_\Sigma \end{aligned} \,.

This conserved current is called the energy-momentum tensor.

Example

(Dirac current)

Consider the Lagrangian field theory of the free Dirac field on Minkowski spacetime in spacetime dimension $p + 1 = 3+1$ (example )

\mathbf{L} = i \overline{\psi} \gamma^\mu \psi_{,\mu} \, dvol_\Sigma \,.

Then the prolongation (prop. ) of the evolutionary vector field (def. )

v \;\coloneqq\; i \psi_\alpha \partial_{\psi_\alpha}

is an infinitesimal symmetry of the Lagrangian (def. ). The conserved current that corresponds to this under Noether's theorem I (prop. ) is

i \overline{\psi} \gamma^\mu \psi \, \iota_{\partial_\mu} dvol_\Sigma \;\in\; \Omega^{p,0}_{\Sigma}(E) \,.

This is called the Dirac current.

Proof

By equation (77) the prolongation of $v$ is

\hat v = i \psi_\alpha \partial_{\psi_\alpha} + i \psi_{\alpha,\mu} \partial_{\psi_{\alpha,\mu}} + \cdots \,.

Therefore the Lagrangian density is strictly invariant under the Lie derivative along $\hat v$

\begin{aligned} \mathcal{L}_{\hat v} \left( i \overline{\psi} \gamma^\mu \psi_{,\mu} \right) dvol_\Sigma & = \underset{ = i \cdot (-i) \overline{\psi} \gamma^\mu \psi_{,\mu} }{ \underbrace{ i \overline{i \psi} \gamma^\mu \psi_{,\mu} } } dvol_\Sigma + \underset{ i \cdot i \overline{\psi} \gamma^\mu \psi_{,\mu} }{ \underbrace{ i \overline{\psi} \gamma^\mu (i \psi_{,\mu}) } } dvol_\Sigma \\ & = 0 \,. \end{aligned}

and so the formula for the corresponding conserved current (79) is

\begin{aligned} J_v & = - \iota_{\hat v} \left( \underset{ - \overline{\psi} \gamma^\mu \delta \psi \, \iota_{\partial_\mu} dvol_\Sigma }{ \underbrace{ \Theta_{BFV} } } \right) \\ & = + i \overline{\psi}\gamma^\mu \psi \, \iota_{\partial_\mu} dvol_\Sigma \end{aligned} \,,

where under the brace we used example to identify the presymplectic potential for the free Dirac field.

$\,$

Since an infinitesimal symmetry of a Lagrangian (def. ) by definition changes the Lagrangian only up to a total spacetime derivative, and since the Euler-Lagrange equations of motion by construction depend on the Lagrangian density only up to a total spacetime derivative (prop. ), it is plausible that and infinitesimal symmetry of the Lagrangian preserves the equations of motion (50), hence the shell (52). That this is indeed the case is the statement of prop. below.

To make the proof transparent, we now first introduce the concept of the evolutionary derivative (def. ) below and then observe that in terms of these the Euler-Lagrange derivative is in fact a derivation (prop. ).

Definition

(field-dependent sections)

For

E \overset{fb}{\longrightarrow} \Sigma

a fiber bundle (def. ), regarded as a field bundle (def. ), and for

E' \overset{fb'}{\longrightarrow} \Sigma

any other fiber bundle over the same base space (spacetime), we write

\Gamma_{J^\infty_\Sigma(E)}(E') \;\coloneqq\; \Gamma_{J^\infty_\Sigma(E)}( jb^\ast E' ) \;=\; Hom_\Sigma(J^\infty_\Sigma(E), E') \;\simeq\; DiffOp(E,E')

for the space of sections of the pullback of bundles of $E'$ to the jet bundle $J^\infty_\Sigma(E) \overset{jb}{\longrightarrow} \Sigma$ (def. ) along $jb$ .

\Gamma_{J^\infty_\Sigma(E)}(E') \;=\; \left\{ \array{ && E' \\ & {}^{\mathllap{}}\nearrow & \downarrow \mathrlap{fb'} \\ J^\infty_\Sigma(E) &\underset{jb}{\longrightarrow}& \Sigma } \phantom{A}\,\, \right\} \,.

(Equivalently this is the space of differential operators from sections of $E$ to sections of $E'$ , according to prop. . )

In (Olver 93, section 5.1, p. 288) the field dependent sections of def. , considered in local coordinates, are referred to as tuples of differential functions.

Example

(source forms and evolutionary vector fields are field-dependent sections)

For $E \overset{fb}{\to} \Sigma$ a field bundle, write $T_\Sigma E$ for its vertical tangent bundle (example ) and $T_\Sigma^\ast E$ for its dual vector bundle (def. ), the vertical cotangent bundle.

Then the field-dependent sections of these bundles according to def. are identified as follows:

the space $\Gamma_{J^\infty_\Sigma(E)}(T_\Sigma E)$ contains the space of evolutionary vector fields $v$ (def. ) as those bundle morphism which respect not just the projection to $\Sigma$ but also its factorization through $E$ :

$\left( \array{ && T_\Sigma E \\ & {}^{\mathllap{v}}\nearrow & \downarrow^{\mathrlap{tb_\Sigma}} \\ J^\infty_\Sigma(E) &\underset{jb_{\infty,0}}{\longrightarrow}& E & \underset{fb}{\longrightarrow}& \Sigma } \right) \;\in\; \Gamma_{J^\infty_\Sigma(E)}(T_\Sigma E)$
$\Gamma_{J^\infty_\Sigma(E)}( T^\ast_\Sigma E) \otimes \wedge^{p+1}_\Sigma(T^\ast \Sigma)$ contains the space of source forms $E$ (prop. ) as those bundle morphisms which respect not just the projection to $\Sigma$ but also its factorization through $E$ :

$\left( \array{ && T^\ast_\Sigma E \\ & {}^{E}\nearrow & \downarrow^{\mathrlap{ctb_\Sigma}} \\ J^\infty_\Sigma(E) &\underset{jb_{\infty,0}}{\longrightarrow}& E & \underset{fb}{\longrightarrow}& \Sigma } \right) \;\in\; \Gamma_{J^\infty_\Sigma(E)}(T^\ast_\Sigma E)$

This makes manifest the duality pairing between source forms and evolutionary vector fields

\array{ \Gamma_{J^\infty_\Sigma(E)}(T_\Sigma E) \otimes \Gamma_{J^\infty_\Sigma(E)}(T^\ast_\Sigma E) &\longrightarrow& C^\infty(J^\infty_\Sigma(E)) }

which in local coordinates is given by

(v^a \partial_{\phi^a} \,,\, \omega_a \delta \phi^a) \mapsto v^a \omega_a

for $v^a, \omega_a \in C^\infty(J^\infty_\Sigma(E))$ smooth functions on the jet bundle (as in prop. ).

Definition

(evolutionary derivative of field-dependent section)

Let

E \overset{fb}{\to} \Sigma

be a fiber bundle regarded as a field bundle (def. ) and let

V \overset{vb}{\to} \Sigma

be a vector bundle (def. ). Then for

P \in \Gamma_{J^\infty_\Sigma(E)}(V)

a field-dependent section of $E$ according to def. , its evolutionary derivative is the morphism

\array{ \Gamma_{J^\infty_\Sigma(E)}(T_\Sigma E) & \overset{ \mathrm{D}P }{\longrightarrow} & \Gamma_{J^\infty_\Sigma(E)}(V) \\ v &\mapsto& \hat v(P) }

which, under the identification of example , sense an evolutionary vector field $v$ to the derivative of $P$ (example ) along the prolongation tangent vector field $\hat v$ of $v$ (prop. ).

In the case that $E$ and $V$ are trivial vector bundles over Minkowski spacetime with coordinates $((x^\mu), (\phi^a))$ and $((x^\mu), (\rho^b))$ , respectively (example ), then by (77) this is given by

((\mathrm{D}P)(v))^b \;=\; \left( v^a \frac{\partial P^b}{\partial \phi^a} + \frac{d v^a}{d x^\mu} \frac{\partial P^b}{\partial \phi^a_{,\mu}} + \frac{d^2 v^a}{d x^\mu d x^\nu} \frac{\partial P^b}{\partial \phi^a_{,\mu \nu}} + \cdots \right)

This makes manifest that $\mathrm{D}P$ may equivalently be regarded as a $J^\infty_\Sigma(E)$ -dependent differential operator (def. ) from the vertical tangent bundle $T_\Sigma E$ (def. ) to $V$ , namely a bundle homomorphism over $\Sigma$ of the form

\mathrm{D}_P \;\colon\; J^\infty_\Sigma(E) \times_\Sigma J^\infty_\Sigma T_\Sigma E \longrightarrow V

in that

(81)

\mathrm{D}_P(-,v) = \mathrm{D}P(v) = \hat v (P) \,.

(Olver 93, def. 5.24)

Example

(evolutionary derivative of Lagrangian function)

Over Minkowski spacetime $\Sigma$ (def. ), let $\mathbf{L} = L dvol \in \Omega^{p+1,0}_\Sigma(E)$ be a Lagrangian density (def. ), with coefficient function regarded as a field-dependent section (def. ) of the trivial real line bundle:

L \;\in \; \Gamma_{J^\infty_\Sigma}(\Sigma \times \mathbb{R}) \,,

Then the formally adjoint differential operator (def. )

(\mathrm{D}_L)^\ast \;\colon\; J^\infty_\Sigma(E)\times_\Sigma (\Sigma \times \mathbb{R})^\ast \longrightarrow T_\Sigma^\ast E

of its evolutionary derivative, def. , regarded as a $J^\infty_\Sigma(E)$ -dependent differential operator $\mathrm{D}_P$ from $T_\Sigma$ to $V$ and applied to the constant section

1 \in \Gamma_\Sigma(\Sigma \times \mathbb{R}^\ast)

is the Euler-Lagrange derivative (50)

\delta_{EL}\mathbf{L} \;=\; \left(\mathrm{D}_{L}\right)^\ast(1) \;\in\; \Gamma_{J^\infty_\Sigma(E)}(T_\Sigma^\ast) \simeq \Omega^{p+1,1}_\Sigma(E)_{source}

via the identification from example .

Proposition

(Euler-Lagrange derivative is derivation via evolutionary derivatives)

Let $V \overset{vb}{\to} \Sigma$ be a vector bundle (def. ) and write $V^\ast \overset{}{\to} \Sigma$ for its dual vector bundle (def. ).

For field-dependent sections (def. )

\alpha \in \Gamma_{J^\infty_\Sigma(E)}(V)

and

\beta^\ast \in \Gamma_{J^\infty_\Sigma(E)}(V^\ast)

we have that the Euler-Lagrange derivative (50) of their canonical pairing to a smooth function on the jet bundle (as in prop. ) is the sum of the derivative of either one via the formally adjoint differential operator (def. ) of the evolutionary derivative (def. ) of the other:

\delta_{EL}( \alpha \cdot \beta^\ast ) \;=\; (\mathrm{D}_\alpha)^\ast(\beta^\ast) + (\mathrm{D}_{\beta^\ast})^\ast(\alpha)

Proof

It is sufficient to check this in local coordinates. By the product law for differentiation we have

\begin{aligned} \frac{ \delta_{EL} \left(\alpha \cdot \beta^\ast \right) } { \delta \phi^a } & = \frac{\partial \left(\alpha \cdot \beta^\ast \right)}{\partial \phi^a} - \frac{d}{d x^\mu} \left( \frac{\partial \left( \alpha \cdot \beta^\ast \right)}{\partial \phi^a_{,\mu}} \right) + \frac{d}{d x^\mu d x^\nu} \left( \frac{\partial \left( \alpha \cdot \beta^\ast \right) }{\partial \phi^a_{,\mu \nu}} \right) - \cdots \\ & = \phantom{+} \frac{\partial \alpha }{\partial \phi^a} \cdot \beta^\ast - \frac{d}{d x^\mu} \left( \frac{\partial \alpha }{\partial \phi^a_{,\mu}} \cdot \beta^\ast \right) + \frac{d}{d x^\mu d x^\nu} \left( \frac{\partial \alpha }{\partial \phi^a_{,\mu \nu}} \cdot \beta^\ast \right) - \cdots \\ & \phantom{=} + \frac{\partial \beta^\ast }{\partial \phi^a} \cdot \alpha - \frac{d}{d x^\mu} \left( \frac{\partial \beta^\ast }{\partial \phi^a_{,\mu}} \cdot \alpha \right) + \frac{d}{d x^\mu d x^\nu} \left( \frac{\partial \beta^\ast }{\partial \phi^a_{,\mu \nu}} \cdot \alpha \right) - \cdots \\ & = (\mathrm{D}_\alpha)_a^\ast(\beta^\ast) + (\mathrm{D}_{\beta^\ast})_a^\ast(\alpha) \end{aligned}

Proposition

(evolutionary derivative of Euler-Lagrange forms is formally self-adjoint)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime (def. ) and regard the Euler-Lagrange derivative

\delta_{EL}\mathbf{L} \;=\; \delta_{EL}L \wedge dvol_\Sigma

(from prop. ) as a field-dependent section of the vertical cotangent bundle

\delta_{EL}L \;\in\; \Gamma_{J^\infty_\Sigma(E)}(T^\ast_\Sigma E)

as in example . Then the corresponding evolutionary derivative field-dependent differential operator $D_{\delta_{EL}L}$ (def. ) is formally self-adjoint (def. ):

(D_{\delta_{EL}L})^\ast \;=\; D_{\delta_{EL}L} \,.

(In terms of the Euler-Lagrange complex, remark , this says that the Helmholtz operator vanishes on the image of the Euler-Lagrange operator.)

(Olver 93, theorem 5.92) The following proof is due to Igor Khavkine.

Proof

By definition of the Euler-Lagrange form (def. ) we have

\frac{\delta_{EL} L }{\delta \phi^a} \delta \phi^a \, \wedge dvol_\Sigma \;=\; \delta L \,\wedge dvol_\Sigma \;+\; d(...) \,.

Applying the variational derivative $\delta$ (def. ) to both sides of this equation yields

\left(\delta \frac{\delta_{EL} L }{\delta \phi^a}\right) \wedge \delta \phi^a \, \wedge dvol_\Sigma \;=\; \underset{= 0}{\underbrace{\delta \delta L}} \wedge dvol_\Sigma \;+\; d(...) \,.

It follows that for $v,w$ any two evolutionary vector fields the contraction (def. ) of their prolongations $\hat v$ and $\hat w$ (def. ) into the differential 2-form on the left is

\left( \delta \frac{\delta_{EL} L }{\delta \phi^a} \wedge \delta \phi^a \right)(v,w) = w^a (\mathrm{D}_{\delta_{EL}})_a(v) - v^b(\mathrm{D}_{\delta_{EL}})_b(w) \,,

by inspection of the definition of the evolutionary derivative (def. ). Moreover, their contraction into the differential form on the right is

\iota_{\hat v} \iota_{\hat w} d(...) \;=\; d(...)

by the fact (prop. ) that contraction with prolongations of evolutionary vector fields anti-commutes with the total spacetime derivative (75).

Hence the last two equations combined give

w^a (\mathrm{D}_{\delta_{EL}})_a(v) - v^b(\mathrm{D}_{\delta_{EL}})_b(w) \;=\; d(...) \,.

This is the defining condition for $\mathrm{D}_{\delta_{EL}}$ to be formally self-adjoint differential operator (def. ).

$\,$

Now we may finally prove that an infinitesimal symmetry of the Lagrangian is also an infinitesimal symmetry of the Euler-Lagrange equations of motion:

Proposition

(infinitesimal symmetries of the Lagrangian are also infinitesimal symmetries of the equations of motion)

Let $(E,\mathbf{L})$ be a Lagrangian field theory. If an evolutionary vector field $v$ is an infinitesimal symmetry of the Lagrangian then the flow along its prolongation $\hat v$ preserves the prolonged shell $\mathcal{E}^\infty \hookrightarrow J^\infty_\Sigma(E)$ (52) in that the Lie derivative of the Euler-Lagrange form $\delta_{EL}\mathbf{L}$ along $\hat v$ vanishes on $\mathcal{E}^\infty$ :

\mathcal{L}_{\hat v}\mathbf{L} = d(...) \phantom{AAA} \Rightarrow \phantom{AAA} \mathcal{L}_{\hat v} \, \delta_{EL}\mathbf{L}\vert_{\mathcal{E}^\infty} = 0 \,.

Proof

Notice that for any vector field $\hat v$ the Lie derivative (prop. ) $\mathcal{L}_{\hat v}$ of the Euler-Lagrange form $\delta_{EL}\mathbf{L} = \frac{\delta_{EL}L}{\delta \phi^a} \delta \phi^a \wedge dvol_\Sigma$ differs from that of its component functions $\frac{\delta_{EL}L}{\delta \phi^a} dvol_\Sigma$ by a term proportional to these component functions, which by definition vanishes on-shell:

\mathcal{L}_{\hat v} \left( \frac{\delta_{EL} L}{\delta \phi^a} \delta \phi^a \wedge dvol_\Sigma \right) \;=\; \underset{ = \hat v\left( \frac{\delta_{EL}L}{\delta \phi^a} \right) }{ \underbrace{ \left( \mathcal{L}_{\hat v} \frac{\delta_{EL}L}{\delta \phi^a} \right) } } \delta \phi^a \wedge dvol_\Sigma + \underset{ = 0 \, \text{on} \, \mathcal{E}^\infty }{ \underbrace{ \frac{\delta_{EL}L}{\delta \phi^a} } } \left( \mathcal{L}_{\hat v} \delta \phi^a \right) \wedge dvol_\Sigma

But the Lie derivative of the component functions is just their plain derivative. Therefore it is sufficient to show that

\hat v \left( \frac{\delta_{EL} L}{\delta \phi^a} \right) \vert_{\mathcal{E}^\infty} \;=\; 0 \,.

Now by Noether's theorem I (prop. ) the condition $\mathcal{L}_{\hat v} = d \tilde J_{\hat v}$ for an infinitesimal symmetry of the Lagrangian implies that the contraction (def. ) of the Euler-Lagrange form with the corresponding evolutionary vector field is a total spacetime derivative:

\iota_{\hat v} \, \delta_{EL}\mathbf{L} \;=\; d J_{\hat v} \,.

Since the Euler-Lagrange derivative vanishes on total spacetime derivative (example ) also its application on the contraction on the left vanishes. But via example that contraction is a pairing of field-dependent sections as in prop. . Hence we use this proposition to compute:

(82)

\begin{aligned} 0 & = \frac{\delta_{EL} \left( v \cdot \delta_{EL} L\right) }{ \delta \phi^a } \\ & = (\mathrm{D}_{v})^\ast_a( \delta_{EL}L ) + (\mathrm{D}_{\delta_{EL}L})^\ast_a(v) \\ & = (\mathrm{D}_{v})^\ast_a( \delta_{EL}L ) + (\mathrm{D}_{\delta_{EL}L})_a(v) \\ & = (\mathrm{D}_{v})^\ast_a( \delta_{EL}L ) + \hat v\left( \frac{\delta_{EL}L}{\delta \phi^a} \right) \,. \end{aligned}

Here the first step is by prop. , the second step is by prop. and the third step is (81).

Hence

\begin{aligned} \hat v(\delta_{EL}L) \vert_{\mathcal{E}^\infty} & = - (\mathrm{D}_{v})^\ast( \delta_{EL}L ) \vert_{\mathcal{E}^\infty} \\ & = 0 \end{aligned} \,,

where in the last line we used that on the prolonged shell $\delta_{EL}L$ and all its horizontal derivatives vanish, by definition.

As a corollary we obtain:

Proposition

(flow along infinitesimal symmetry of the Lagrangian preserves on-shell space of field histories)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

For $v$ an infinitesimal symmetry of the Lagrangian (def. ) the flow on the space of field histories (example ) that it induces by def. preserves the space of on-shell field histories (from prop. ):

\array{ \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} &\hookrightarrow& \Gamma_\Sigma(E) \\ {\mathllap{\exp(\hat v)\vert_{\delta_{EL}\mathbf{L} = 0} }} \uparrow && \uparrow {\mathrlap{\exp(\hat v)}} \\ \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} &\hookrightarrow& \Gamma_\Sigma(E) }

Proof

By def. a field history $\Phi \in \Gamma_\Sigma(E)$ is on-shell precisely if its jet prolongation $j^\infty_\Sigma(E)$ (def. ) factors through the shell $\mathcal{E} \hookrightarrow J^\infty_\Sigma(E)$ (51). Hence by def. the statement is equivalently that the ordinary flow (prop. ) of $\hat v$ (def. ) on the jet bundle $J^\infty_\Sigma(E)$ preserves the shell. This in turn means that it preserves the vanishing locus of the Euler-Lagrange form $\delta_{EL} \mathbf{L}$ , which is the case by prop. .

$\,$

infinitesimal symmetries of the presymplectic potential current

Evidently Noether's theorem I in variational calculus (prop. ) is the special case for horizontal $p+1$ -forms of a more general phenomenon relating symmetries of variational forms to forms that are closed up to a contraction. The same phenomenon applied instead to the presymplectic current yields the following:

Definition

(variational Lie derivative)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) with jet bundle $J^\infty_\Sigma(E)$ (def. ).

For $v$ a vertical tangent vector field on the jet bundle (a variation def. ) write

(83)

\mathcal{L}^{var}_{v} \;\coloneqq\; \delta \circ \iota_v + \iota_v \circ \delta

for the variational Lie derivative along $v$ , analogous to Cartan's homotopy formula (prop. ) but defined in terms of the variational derivative $\delta$ (35) as opposed to the full de Rham differential.

Then for $v_1$ and $v_2$ two vertical vector fields, write

[v_1, v_2]^{var} \;\in \; \Gamma( T_{vert} J^\infty_\Sigma(E) )

for the vector field whose contraction operator (def. ) is given by

\begin{aligned} \iota_{[v_1,v_2]^{var}} & = \left[ \mathcal{L}^{var}_{v_1}, \iota_{v_2} \right] \\ & \coloneqq \mathcal{L}^{var}_{v_1} \circ \iota_{v_2} - \iota_{v_2} \circ \mathcal{L}^{var}_{v_1} \end{aligned} \,,

Definition

(infinitesimal symmetry of the presymplectic potential and Hamiltonian differential forms)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) with presymplectic potential current $\Theta_{BFV}$ (53). Write $\mathcal{E} \hookrightarrow J^\infty_\Sigma(E)$ for the shell (51).

Then:

An on-shell variation $v$ (def. ) is an infinitesimal symmetry of the presymplectic current or Hamiltonian vector field if on-shell (def. ) its variational Lie derivative along $v$ (def. ) is a variational derivative:

$(\delta \circ \iota_v + \iota_v \circ \delta) \Theta_{BFV} = \delta \tilde H_v \phantom{AAA} \text{on}\, \mathcal{E}$

for some variational form $\tilde H_v$ .
A Hamiltonian differential form $H$ (or local Hamiltonian current) is a variational form on the shell such that there exists a variation $v$ with

$\delta H = \iota_v \Omega_{BFV} \phantom{AA} \, \text{on}\, \mathcal{E} \,.$

We write

\Omega^{p,0}_{\Sigma, Ham}(E) \;\coloneqq\; \left\{ (H,v) \;\vert\; v \, \text{is a variation and}\, \iota_v \Omega_{BFV} = \delta H \right\}

for the space of pairs consisting of a Hamiltonian differential forms on-shell and a corresponding variation.

Proposition

(Hamiltonian Noether's theorem)

A variation $v$ is an infinitesimal symmetry of the presymplectic potential (def. ) with $\mathcal{L}^{var}_v ( \Theta_{BFV} ) = \delta \tilde H_v$ precisely if

H_v \coloneqq \tilde H_v - \iota_v \Theta_{BFV}

is a Hamiltonian differential form for $v$ .

Proof

From the definition (83) of $\mathcal{L}^{var}_v$ we have

\begin{aligned} & \mathcal{L}^{var}_v \Theta_{BFV} = \delta \tilde H_v \\ \Leftrightarrow\;\; & \delta \iota_v \Theta_{BFV} + \iota_v \underset{= \Omega_{BFV}}{\underbrace{\delta \Theta_{BFV}}} = \delta \tilde H_v \\ \Leftrightarrow\;\; & \delta \left( \tilde H_v - \iota_v \Theta_{BFV} \right) = \iota_v \Omega_{BFV} \,, \end{aligned}

where we used the definition (55) of $\Omega_{BFV}$ .

$\,$

Since therefore both the conserved currents from Noether's theorem as well as the Hamiltonian differential forms are generators of infinitesimal symmetries of certain variational forms (namely of the Lagrangian density and of the presymplectic current, respectively) they form a Lie algebra. For the conserved currents this is sometimes known as the Dickey bracket Lie algebra. For the Hamiltonian forms it is the Poisson bracket Lie p+1-algebra. Since here for simplicity we are considering just vertical variations, we have just a plain Lie algebra. The transgression of this Lie algebra of Hamiltonian forms on the jet bundle to Cauchy surfaces yields a presymplectic structure on phase space, this we discuss below.

Proposition

(local Poisson bracket)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

On the space $\Omega^{p,0}_{\Sigma,Ham}(E)$ pairs $(H,v)$ of Hamiltonian differential forms $H$ with compatible variation $v$ (def. ) the following operation constitutes a Lie bracket:

(84)

\left\{(H_1, v_1),\, (H_2, v_2)\right\} \;\coloneqq\; (\iota_{v_1} \iota_{v_2} \Omega_{BFV},\, [v_1,v_2]^{var}) \,,

where $[v_1, v_2]^{var}$ is the variational Lie bracket from def. .

We call this the local Poisson Lie bracket.

Proof

First we need to check that the bracket is well defined in itself. It is clear that it is linear and skew-symmetric, but what needs proof is that it does indeed land in $\Omega^{p,0}_{\Sigma,Ham}(E)$ , hence that the following equation holds:

\delta \iota_{v_2} \iota_{v_1} \Omega_{BFV} \;=\; \iota_{[v_1, v_2]^{var}} \Omega_{BFV} \,.

With def. for $\mathcal{L}^{var}$ and $[-,-]^{var}$ we compute this as follows:

\begin{aligned} \delta \iota_{v_1} \iota_{v_2} \Omega_{BFV} & = \tfrac{1}{2} \delta \iota_{v_1} \iota_{v_2} \Omega_{BFV} - \tfrac{1}{2} (v_1 \leftrightarrow v_2) \\ & = \tfrac{1}{2} \left( \mathcal{L}^{var}_{v_1} \iota_{v_2} \Omega_{BFV} - \iota_{v_1} \delta \iota_{v_2} \Omega_{BFV} \right) - \tfrac{1}{2} (v_1 \leftrightarrow v_2) \\ & = \tfrac{1}{2} \left( \mathcal{L}^{var}_{v_1} \iota_{v_2} \Omega_{BFV} - \iota_{v_1} \mathcal{L}^{var}_{v_2} \Omega_{BFV} + \iota_{v_1} \iota_{v_2} \underset{= 0}{\underbrace{\delta \Omega_{BFV}}} \right) - \tfrac{1}{2} (v_1 \leftrightarrow v_2) \\ & = [\mathcal{L}^{var}_{v_2}, \iota_{v_1}] \Omega_{BFV} \\ & = \iota_{[v_1, v_2]^{var}} \Omega_{BFV} \,. \end{aligned}

This shows that the bracket is well defined.

It remains to see that the bracket satifies the Jacobi identity:

\left\{ (H_1, v_1), \left\{ (H_2, v_2), (H_3,v_3) \right\} \right\} \;+\; (cyclic) \;=\; 0

hence that

\left( \iota_{v_1} \iota_{[v_2,v_3]^{var}} \Omega_{BFV} ,\, [v_1, [v_2, v_2]^{var}]^{var} \right) \;+\; (cyclic) \;=\; 0 \,.

Here $[v_1, [v_2, v_3]^{var}]^{var} + (cyclic) = 0$ holds because by def. $[v_1,-]^{var}$ acts as a derivation, and hence what remains to be shown is that

\iota_{v_1} \iota_{\left([v_2, v_3]^{var}\right)} \Omega_{BFV} + (cyclic) = 0

We check this by repeated uses of def. , using in addition that

$\delta \iota_{v_i} \Omega_{BFV} = 0$

(since $\iota_{v_i} \Omega_{BFV} = \delta H_i$ by $v_i$ being Hamiltonian)
$\mathcal{L}^{var}_{v_i} \Omega_{BFV} = 0$

(since in addition $\delta \Omega_{BFV} = 0$ )
$\iota_{v_1} \iota_{v_2} \iota_{v_3} \Omega_{BFV} = 0$

(since $\Omega_{BFV} \in \Omega^{p,2}_\Sigma(E)$ is of vertical degree 2, and since all variations $v_i$ are vertical by assumption).

So we compute as follows (a special case of FRS 13b, lemma 3.1.1):

\begin{aligned} 0 & = \delta \iota_{v_1} \iota_{v_2} \iota_{v_3} \Omega_{BFV} \\ & = \mathcal{L}^{var}_{v_1} \iota_{v_2} \iota_{v_3} \Omega_{BFV} - \iota_{v_1} \delta \iota_{v_2} \iota_{v_3} \Omega_{BFV} \\ & = \iota_{[v_1, v_2]^{var}} \iota_{v_3} \Omega_{BFV} + \iota_{v_2} \mathcal{L}^{var}_{v_1} \iota_{v_3} \Omega_{BFV} - \iota_{v_1} \mathcal{L}^{var}_{v_2} \iota_{v_3} \Omega_{BFV} + \iota_{v_1} \iota_{v_2} \delta \iota_{v_3} \Omega_{BFV} \\ & = \iota_{[v_1, v_2]^{var}} \iota_{v_3} \Omega_{BFV} + \iota_{v_2} \iota_{[v_1,v_3]^{var}} \Omega_{BFV} - \iota_{v_1} \iota_{[v_2, v_3]^{var}} \Omega_{BFV} \\ & = - \iota_{v_1} \iota_{[v_2, v_3]^{var}} \Omega_{BFV} - \iota_{v_2} \iota_{[v_3, v_1]^{var}} \Omega_{BFV} - \iota_{v_3} \iota_{[v_1, v_2]^{var}} \Omega_{BFV} \,. \end{aligned}

$\,$

The local Poisson bracket Lie algebra $(\Omega^{p,0}_{\Sigma,Ham}(E), [-,-]^{var})$ from prop. is but the lowest stage of a higher Lie theoretic structure called the Poisson bracket Lie p-algebra. Here we will not go deeper into this higher structure (see at Higher Prequantum Geometry for more), but below we will need the following simple shadow of it:

Lemma

The horizontally exact Hamiltonian forms constitute a Lie ideal for the local Poisson Lie bracket (84).

Proof

Let $E$ be a horizontally exact Hamiltonian form, hence

E = d K

for some $K$ . Write $e$ for a Hamiltonian vector field for $E$ .

Then for $(H,v)$ any other pair consisting of a Hamiltonian form and a corresponding Hamiltonian vector field, we have

\begin{aligned} \iota_v \, \iota_e \, \Omega_{BFV} & = \phantom{-}\iota_v \, \delta E \\ & = \phantom{-}\iota_v \, \delta \, d \, K \\ & = - \iota_v \, d \, \delta K \\ & = \phantom{-}d \, \iota_v \, \delta \, K \,. \end{aligned}

Here we used that the horizontal derivative anti-commutes with the vertical one by construction of the variational bicomplex, and that $\iota_e$ anti-commutes with the horizontal derivative $d$ since the variation $e$ (def. ) is by definition vertical.

Example

(local Poisson bracket for real scalar field)

Consider the Lagrangian field theory for the free real scalar field from example .

By example its presymplectic current is

\Omega_{BFV} = \eta^{\mu \nu} \delta \phi_{,\mu} \wedge \delta \phi \wedge \iota_{\partial_\mu} dvol_\Sigma \,

The corresponding local Poisson bracket algebra (prop. ) has in degree 0 Hamiltonian forms (def. ) such as

Q \;\coloneqq\; \phi \,\iota_{\partial_0} dvol_\Sigma \in \Omega^{p,0}(E)

and

P \;\coloneqq\; \eta^{\mu \nu} \phi_{,\mu} \, \iota_{\partial_\nu} dvol_{\Sigma} \in \Omega^{p,0}(E) \,.

The corresponding Hamiltonian vector fields are

v_Q = -\partial_{\phi_{,0}}

and

v_P = - \partial_{\phi} \,.

Hence the corresponding local Poisson bracket is

\{P,Q\} = \iota_{v_P} \iota_{v_Q} \omega = \iota_{\partial_0} dvol_\Sigma \,.

More generally for $b_1, b_2 \in C^\infty_{cp}(\Sigma)$ two bump functions then

\{ b_1 P, b_2 Q \} = b_1 b_2 \iota_{\partial_0} dvol_\Sigma \,.

Example

(local Poisson bracket for free Dirac field)

Consider the Lagrangian field theory of the free Dirac field on Minkowski spacetime (example ), whose presymplectic current is, according to example , given by

(85)

\Omega_{BFV} \;=\; (\overline{\delta \psi}) \, \gamma^\mu \, (\delta \psi) \, \iota_{\partial_\mu} dvol_\Sigma \,.

Consider this specifically in spacetime dimension $p + 1 = 4$ in which case the components $\psi_\alpha$ are complex number-valued (by prop./def. ), so that the tuple $(\psi_\alpha)$ amounts to 8 real-valued coordinate functions. By changing complex coordinates, we may equivalently consider $(\psi_\alpha)$ as four coordinate functions, and $(\overline{\psi}^\alpha)$ as another four independent coordinate functions.

Using this coordinate transformation, it is immediate to find the following pairs of Hamiltonian vector fields and their Hamiltonian differential forms from def. applied to (85)

Hamiltonian vector field	Hamiltonian differential form
$\phantom{AA} \partial_{\psi_\alpha}$	$\phantom{AA}\left(\overline{\delta \psi}\gamma^\mu\right)^\alpha\, \iota_{\partial_\mu} dvol_\Sigma$
$\phantom{AA} \partial_{\overline{\psi}_\alpha}$	$\phantom{AA}\left( \gamma^\mu \psi \right)_\alpha \, \iota_{\partial_\mu} dvol_\Sigma$

and to obtain the following non-trivial local Poisson brackets (prop. ) (the other possible brackets vanish):

\left\{ \left( \gamma^\mu \psi \right)_\alpha \, \iota_{\partial_\mu} dvol_\Sigma \,,\, \left(\overline{\psi}\gamma^\mu\right)^\beta\, \iota_{\partial_\mu} dvol_\Sigma \right\} \;=\; \left(\gamma^\mu\right)_\alpha{}^{\beta} \, \iota_{\partial_\mu} dvol_\Sigma \,.

Notice the signs: Due to the odd-grading of the field coordinate function $\psi$ , its variational derivative $\delta \psi$ has bi-degree $(1,odd)$ and the contraction operation $\iota_{\psi}$ has bi-degree $(-1,odd)$ , so that commuting it past $\overline{\psi}$ picks up two minus signs, a “cohomological” sign due to the differential form degrees, and a “supergeometric” one (def. ):

\iota_{\partial_\psi} \overline{\delta \psi} \cdots = (-1) (-1) \overline{\delta \psi} \,\iota_{\partial_\psi} \cdots \,.

For the same reason, the local Poisson bracket is a super Lie algebra with symmetric super Lie bracket:

\left\{ \left( \gamma^\mu \psi \right)_\alpha \, \iota_{\partial_\mu} dvol_\Sigma \,,\, \left(\overline{\psi}\gamma^\mu\right)^\beta\, \iota_{\partial_\mu} dvol_\Sigma \right\} \;=\; + \left\{ \left(\overline{\psi}\gamma^\mu\right)^\beta\, \iota_{\partial_\mu} dvol_\Sigma \,,\, \left( \gamma^\mu \psi \right)_\alpha \, \iota_{\partial_\mu} dvol_\Sigma \right\} \,.

$\,$

This concludes our discussion of general infinitesimal symmetries of a Lagrangian. We pick this up again in the discussion of Gauge symmetries below. First, in the next chapter we discuss the concept of observables in field theory.

Observables

In this chapter we discuss these topics:

General observables
Polynomial off-shell observables and Distributions
Polynomial on-shell observables and Distributional solutions to PDEs
Local observables and Transgression
Infinitesimal observables
States

$\,$

Given a Lagrangian field theory (def. ), then a general observable quantity or just observable for short (def. below), is a smooth function

A \;\colon\; \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} \longrightarrow \mathbb{C}

on the on-shell space of field histories (example , example ) hence a smooth “functional” of field histories. We think of this as assigning to each physically realizable field history $\Phi$ the value $A(\Phi)$ of the given quantity as exhibited by that field history. For instance concepts like “average field strength in the compact spacetime region $\mathcal{O}$ ” should be observables. In particular the field amplitude at spacetime point $x$ should be an observable, the “field observable” denoted $\mathbf{\Phi}^a(x)$ .

Beware that in much of the literature on field theory, these point-evaluation field observables $\mathbf{\Phi}^a(x)$ (example below ) are eventually referred to as “fields” themselves, blurring the distinction between

types of fields/field bundles $E$ ,
field histories/sections $\Phi$ ,
functions on the space of field histories $\mathbf{\Phi}^a(x)$ .

In particular, the process of quantization (discussed in Quantization below) affects the third of these concepts only, in that it deforms the algebra structure on observables to a non-commutative algebra of quantum observables. For this reason the field observables $\mathbf{\Phi}^a(x)$ are often referred to as quantum fields. But to understand the conceptual nature of quantum field theory it is important that the $\mathbf{\Phi}^a(x)$ are really the observables or quantum observables on the space of field histories.

fields

aspect	term	type	description	def.
field component	$\phi^a$ , $\phi^a_{,\mu}$	$J^\infty_\Sigma(E) \to \mathbb{R}$	coordinate function on jet bundle of field bundle	def. , def.
field history	$\Phi$ , $\frac{\partial \Phi}{\partial x^\mu}$	$\Sigma \to J^\infty_\Sigma(E)$	jet prolongation of section of field bundle	def. , def.
field observable	$\mathbf{\Phi}^a(x)$ , $\partial_{\mu} \mathbf{\Phi}^a(x),$	$\Gamma_{\Sigma}(E) \to \mathbb{R}$	derivatives of delta-functional on space of sections	def. , example
averaging of field observable	$\alpha^\ast \mapsto \underset{\Sigma}{\int} \alpha^\ast_a(x) \mathbf{\Phi}^a(x) \, dvol_\Sigma(x)$	$\Gamma_{\Sigma,cp}(E^\ast) \to Obs(E_{scp},\mathbf{L})$	observable-valued distribution	def.
algebra of quantum observables	$\left( Obs(E,\mathbf{L})_{\mu c},\, \star\right)$	$\mathbb{C}Alg$	non-commutative algebra structure on field observables	def. , def.

$\,$

There are various further conditions on observables which we will eventually consider, forming subspaces of gauge invariant observables (def. ), local observables (def. below), Hamiltonian local observables (def. below) and microcausal observables (def. ). While in the end it is only these special kinds of observables that matter, it is useful to first consider the unconstrained concept and then consecutively characterize smaller subspaces of well-behaved observables. In fact it is useful to consider yet more generally the observables on the full space of field histories (not just the on-shell subspace), called the off-shell observables.

In the case that the field bundle is a vector bundle (example ), the off-shell space of field histories is canonically a vector space and hence it makes sense to consider linear off-shell observables, i.e. those observables $A$ with $A(c \Phi) = c A(\Phi)$ and $A(\Phi_1 + \Phi_2) = A(\Phi_1) + A(\Phi_2)$ . It turns out that these are precisely the compactly supported distributions in the sense of Laurent Schwartz (prop. below). This fact makes powerful tools from functional analysis and microlocal analysis available for the analysis of field theory (discussed below).

More generally there are the multilinear off-shell observables, and these are analogously given by distributions of several variables (def. below). In fully perturbative quantum field theory one considers only the infinitesimal neighbourhood (example ) of a single on-shell field history and in this case all observables are in fact given by such multilinear observables (def. below).

For a free field theory (def. ) whose Euler-Lagrange equations of motion are given by a linear differential operator which behaves well in that it is “Green hyperbolic” (def. below) it follows that the actual on-shell linear observables are equivalently those off-shell observables which are spatially compactly supported distributional solutions to the formally adjoint equation of motion (prop. below); and this equivalence is exhibited by composition with the causal Green function (def. below):

This is theorem below, which is pivotal for passing from classical field theory to quantum field theory:

\left\{ \,\, \array{ \text{polynomial} \\ \text{on-shell} \\ \text{observables} } \,\, \right\} \underoverset{\simeq}{\text{restriction}}{\longleftarrow} \left\{ \array{ \text{polynomial} \\ \text{off-shell} \\ \text{observables} \\ \text{modulo equations of motion} } \right\} \underoverset{\simeq}{\text{causal propagator}}{\longleftarrow} \left\{ \array{ \text{spatially compactly supported} \\ \text{distributions in several variables} \\ \text{which are distributional solutions} \\ \text{to the adjoint equations of motion} } \right\}

This fact makes, in addition, the distributional analysis of linear differential equations available for the analysis of free field theory, notably the theory of propagators, such as Feynman propagators (def. below), which we turn to in Propagators below.

The functional analysis and microlocal analysis (below) of linear observables re-expressed in distribution theory via theorem solves the issues that the original formulation of perturbative quantum field theory by Schwinger-Tomonaga-Feynman-Dyson in the 1940s was notorious for suffering from (Feynman 85): The normal ordered product of quantum observables in a Wick algebra of observables follows from Hörmander's criterion for the product of distributions to be well-defined (this we discuss in Free quantum fields below) and the renormalization freedom in the construction of the S-matrix is governed by the mechanism of extensions of distributions (this we discuss in Renormalization below).

Among the polynomial on-shell observables characterized this way, the focus is furthermore on the local observables:

In local field theory the idea is that both the equations of motion as well as the observations are fully determined by their restriction to infinitesimal neighbourhoods of spacetime points (events). For the equations of motion this means that they are partial differential equations as we have seen above. For the observables it should mean that they must be averages over regions of spacetime of functions of the value of the field histories and their derivatives at any point of spacetime. Now a “smooth function of the value of the field histories and their derivatives at any point” is precisely a smooth function on the jet bundle of the field bundle (example ) pulled back via jet prolongation (def. ). If this is to be averaged over spacetime it needs to be the coefficient of a horizontal $p+1$ -form (prop. ).

In mathematical terminology these desiderata say that the local observables in a local field theory should be precisely the “transgressions” (def. below) of horizontal variational $p+1$ -forms (with compact spacetime support, def. below) to the space of field histories (example ). This is def. below.

A key example of a local observable in Lagrangian field theory (def. ) is the action functional (example below). This is the transgression of the Lagrangian density itself, or rather of its product with an “adiabatic switching function” that localizes its support in a compact spacetime region. In typical cases the physical quantity whose observation is represented by the action functional is the difference of the kinetic energy-momentum minus the potential energy of a field history averaged over the given region of spacetime.

The equations of motion of a Lagrangian field theory say that those field histories are physically realized which are critical points of this action functional observable. This is the principle of extremal action (prop. below).

In summary we find the following system of types of observables:

types of observables in perturbative quantum field theory:

\array{ && \text{local} \\ && & \searrow \\ \text{field} &\longrightarrow& \text{linear} &\longrightarrow& \text{microcausal} &\longrightarrow& \text{polynomial} &\longrightarrow& \text{general} \\ && & \nearrow \\ && \text{regular} }

In the chapter Free quantum fields we will see that the space of all polynomial observables is too large to admit quantization, while the space of regular local observables is too small to contain the usual interaction terms for perturbative quantum field theory (example ) below. The space of microcausal polynomial observables (def. below) is in between these two extremes, and evades both of these obstacles.

$\,$

Given the concept of observables, it remains to formalize what it means for the physical system to be in some definite state so that the observable quantities take some definite value, reflecting the properties of that state.

Whatever formalization for states of a field theory one considers, at the very least the space of states $States$ should come with a pairing linear map

\array{ Obs \otimes States & \longrightarrow& \mathcal{C} \\ \left( A , \langle - \rangle \right) &\mapsto& \langle A \rangle }

which reads in an observable quantity $A$ and a state, to be denoted $\langle - \rangle$ , and produces the complex number $\langle A \rangle$ which is the “value of the observable quantity $A$ in the case that the physical system is in the state $\langle -\rangle$ ”.

One might imagine that it is fundamentally possible to pinpoint the exact field history that the physical system is found in. From this perspective, fixing a state should simply mean to pick such a field history, namely an element $\Phi \in \Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0}$ in the on-shell space of field histories. If we write $\langle -\rangle_{\Phi}$ for this state, its pairing map with the observables would simply be evaluation of the observable, being a function on the field history space, on that particular element in this space:

\langle A \rangle_{\Phi} \coloneqq A(\Phi) \,.

However, in the practice of experiment a field history can never be known precisely, without remaining uncertainty. Moreover, quantum physics (to which we finally come below), suggests that this is true not just in practice, but even in principle. Therefore we should allow states to be a kind of probability distributions on the space of field histories, and regard the pairing $\langle A \rangle$ of a state $\langle - \rangle$ with an observable $A$ as a kind of expectation value of the function $A$ averaged with respect to this probability distribution. Specifically, if the observable quantity $A$ is (a smooth approximation to) a characteristic function of a subset $S \subset \Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0}$ of the space of field histories, then its value in a given state should be the probability to find the physical system in that subset of field histories.

But, moreover, the superposition principle of quantum physics says that the actually observable observables are only those of the form $A^\ast A$ (for $A^\ast$ the image under the star-operation on the star algebra of observables.

This finally leads to the definition of states in def. below.

$\,$

General observables

Definition

(observables)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) with $\Gamma_\Sigma(E)_{\delta_{EL} \mathbf{L} = 0}$ its on-shell space of field histories (def. ).

Then the space of observables is the super formal smooth set (def. ) which is the mapping space

Obs(E,\mathbf{L}) \;\coloneqq\; \left[ \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} \,,\, \mathbb{C} \right]

from the on-shell space of field histories to the complex numbers.

Similarly there is the space of off-shell observables

(86)

Obs(E) \;\coloneqq\; \left[ \Gamma_\Sigma(E) \,,\, \mathbb{C} \right] \,.

Every off-shell observables induces an on-shell observable by restriction, this yields a smooth function

(87)

Obs(E) \overset{(-)_{\delta_{EL}\mathbf{L} = 0}}{\longrightarrow} Obs(E,\mathbf{L})

similarly we may consider the observables on the sup-spaces of field histories with restricted causal support according to def. . We write

Obs(E_{scp}) \;\coloneqq\; \left[ \Gamma_{\Sigma,scp}(E), \mathbb{C} \right]

and

(88)

Obs(E_{scp}, \mathbf{L}) \;\coloneqq\; \left[ \Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0}, \mathbb{C} \right]

for the spaces of (off-shell) observables on field histories with spatially compact support (def. ).

$\,$

Observables form a commutative algebra under pointwise product:

(89)

\array{ Obs(E) \otimes Obs(E) &\overset{(-)\cdot (-)}{\longrightarrow}& Obs(E) \\ (A_1, A_2) &\mapsto& A_1 \cdot A_2 }

given by

(A_1 \cdot A_2)(\Phi_{(-)}) \coloneqq A_1(\Phi_{(-)}) \cdot A_2(\Phi_{(-)}) \,,

where on the right we have the product in $\mathbb{C}$ .

(Suitable subspaces of observables will in addition carry other products, notably non-commutative algebra structures, this is the topic of the chapters Free quantum fields and Quantum observables below.)

$\,$

Observables on bosonic fields

In the case that $E$ is a purely bosonic field bundle in smooth manifolds so that $\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}$ is a diffeological space (def. , def. ) this means that a single observable $A \in Obs_{E,\mathbf{L}}$ is equivalently a smooth function (def. )

A \;\colon\; \Gamma_\Sigma(E)_{\delta_{EL} \mathbf{L} = 0} \longrightarrow \mathbb{C} \,.

Explicitly, by def. (and similarly by def. ) this means that $A$ is for each Cartesian space $U$ (generally: super Cartesian space, def. ) a natural function of plots

A_U \;\colon\; \left\{ \array{ U \times \Sigma && \overset{\Phi_{(-)}}{\longrightarrow} && E \\ & {}_{\mathllap{pr_2}}\searrow && \swarrow_{\mathrlap{fb}} \\ && \Sigma } \right\}_{\delta_{EL}\mathbf{L} = 0} \;\overset{}{\longrightarrow}\; \left\{ U \to \mathbb{C} \right\} \,.

Observables on fermionic fields

In the case that $E$ has purely fermionic fibers (def. ), such as for the Dirac field (example ) with $E = \Sigma\times S_{odd}$ then the only points in $Obs_{E}$ , namely morphisms $\mathbb{R}^0 \to Obs_E$ are observables depending on an even power of field histories; while general observables appear as possibly odd-parameterized families

(\theta \mapsto \theta \Psi) \;\colon\; \mathbb{R}^{0\vert 1} \longrightarrow Obs_{E,\mathbf{L}}

whose component $\mathbf{\Psi}$ is a section of the even-graded field bundle, regarded in odd degree, via prop. . See example below.

The most basic kind of observables are the following:

Example

(point evaluation observables – field observables)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle (def. ) over some spacetime $\Sigma$ happens to be a trivial vector bundle in even degree (i.e. bosonic) with field fiber coordinates $(\phi^a)$ (example ). With respect to these coordinates a field history, hence a section of the field bundle

\Phi \;\in \; \Gamma_\Sigma(E)

has components $(\Phi^a)$ which are smooth functions on spacetime.

Then for every index $a$ and every point $x \in \Sigma$ in spacetime (every event) there is an observable (def. ) denoted $\mathbf{\Phi}^a(x)$ which is given by

\mathbf{\Phi}^a(x) \;\colon\; \Phi_{(-)} \mapsto \Phi_{(-)}^a(x) \,,

hence which on a test space $U$ (a Cartesian space or more generally super Cartesian space, def. ) sends a $U$ -parameterized collection of fields

\Phi_{(-)} \colon U \to \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}

to their $U$ -parameterized collection of values at $x$ of their $a$ -th component.

Notice how the various aspects of the concept of “field” are involved here, all closely related but crucially different:

\array{ \mathbf{\Phi}^a(x) &\colon& \Phi &\overset{\phantom{AA}}{\mapsto}& \Phi^a(x) &=& \phi^a & \circ \Phi(x) \\ \array{ \text{field} \\ \text{observable} } && \array{ \text{field} \\ \text{history} } && \array{ \text{field} \\ \text{value} } && \array{ \text{field} \\ \text{component} } }

$\,$

Polynomial off-shell Observables and Distributions

We consider here linear observables (def. below) and more generally quadratic observables (def. ) and generally polynomial observables (def. below) for free field theories and discuss how these are equivalently given by integration against generalized functions called distributions (prop. and prop. below).

This is the basis for the discussion of quantum observables for free field theories further below.

$\,$

Definition

(linear off-shell observables)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle $E$ (def. ) is a super vector bundle (as in example and as opposed to more general non-linear fiber bundles).

This means that the off-shell space of field histories $\Gamma_\Sigma(E)$ (example ) inherits the structure of a super vector space by spacetime-pointwise (i.e. event-wise) scaling and addition of field histories.

Then an off-shell observable (def. )

A \;\colon\; \Gamma_\Sigma(E) \longrightarrow \mathbb{C}

is a linear observable if it is a linear function with respect to this vector space structure, hence if

A\left( c \Phi_{(-)}) = c A(\Phi_{(-)} \right) \phantom{AAAA} \text{and} \phantom{AAAA} A\left(\Phi_{(-)} + \Phi'_{(-)} \right) = A\left( \Phi_{(-)}) + A(\Phi'_{(-)} \right)

for all plots of field histories $\Phi_{(-)}, \Phi'_{(-)}$ .

If moreover $(E,\mathbf{L})$ is a free field theory (def. ) then the on-shell space of field histories inherits this linear structure and we may similarly speak of linear on-shell observables.

We write

LinObs(E,\mathbf{L}) \hookrightarrow Obs(E,\mathbf{L})

for the subspace of linear observables inside all observables (def. ) and similarly

LinObs(E) \hookrightarrow Obs(E)

for the linear off-shell observables inside all off-shell observables, and similarly for the subspaces of linear observables on field histories of spatially compact supprt (88):

(90)

LinObs(E_{scp}, \mathbf{L}) \hookrightarrow Obs(E_{scp}, \mathbf{L})

and

LinObs(E_{scp}) \hookrightarrow Obs(E_{scp}) \,.

Example

(point evaluation observables are linear)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime (def. ), whose field bundle $E$ (def. ) is the trivial vector bundle with field coordinates $(\phi^a)$ (example ).

Then for each field component index $a$ and point $x \in \Sigma$ of spacetime (each event) the point evaluation observable (example )

\array{ \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} &\overset{\mathbf{\Phi}^a(x)}{\longrightarrow}& \mathbb{C} \\ \phi &\mapsto& \phi^a(x) }

is a linear observable according to def. . The distribution that it corresponds to under prop. is the Dirac delta-distribution at the point $x$ combined with the Kronecker delta on the index $a$ : In the generalized function-notation of remark this reads:

\Phi^a(x) \;\colon\; \Phi \mapsto \int_\Sigma \Phi^b(y) \delta_b^a \delta(x,y) \, dvol_\Sigma(y) \,.

Proposition

(linear off-shell observables of scalar field are the compactly supported distributions)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime (def. ), whose field bundle $E$ (def. ) is the trivial real line bundle (as for the real scalar field, example ). This means that the off-shell space of field histories $\Gamma_\Sigma(E) \simeq C^\infty(\Sigma)$ (19) is the real vector space of smooth functions on Minkowski spacetime and that every linear observable $A$ (def. ) gives a linear function

A_\ast \;\colon\; C^\infty(\Sigma)_{\delta_{EL}\mathbf{L} = 0} \longrightarrow \mathbb{C} \,.

This linear function $A_\ast$ is in fact a compactly supported distribution, in the sense of functional analysis, in that it satisfies the following Fréchet vector space continuity condition:

Fréchet continuous linear functional

A linear function $A_\ast \;\colon\; C^\infty(\mathbb{R}^{p,1}) \to \mathbb{R}$ is called continuous if there exists
1. a compact subset $K \subset \mathbb{R}^{p,1}$ of Minkowski spacetime;
2. a natural number $k \in \mathbb{N}$ ;
3. a positive real number $C \in \mathbb{R}_+$
such that for all on-shell field histories

$\Phi \in C^\infty(\Sigma)_{\delta_{EL}\mathbf{L} = 0}$

the following inequality of absolute values ${\vert -\vert}$ of partial derivatives holds

${\vert A_\ast(\Phi)\vert} \;\leq\; C \underset{{\vert \alpha \vert} \leq k}{\sum} \, \underset{x \in K}{sup} {\vert \partial^\alpha \Phi(x)\vert} \,,$

where the sum is over all multi-indices $\alpha \in \mathbb{N}^{p+1}$ (1) whose total degree ${\vert \alpha\vert} \coloneqq \alpha_0 + \cdots + \alpha_{p}$ is bounded by $k$ , and where

$\partial^\alpha \Phi \;\coloneqq\; \frac{\partial^{{\vert \alpha\vert}} \Phi }{ \partial^{\alpha_0} x^0 \partial^{\alpha_1} x^1 \cdots \partial^{\alpha^p} x^p }$

denotes the corresponding partial derivative (1).

This identification constitutes a linear isomorphism

\array{ LinObs(\Sigma \times \mathbb{R}) &\overset{\simeq}{\longrightarrow}& \mathcal{E}'(\Sigma) \\ \array{ \text{linear off-shell} \\ \text{observables} \\ \text{of the scalar field} } && \array{ \text{compactly supported} \\ \text{distributions} \\ \text{on spacetime} } } \,,

saying that all compactly supported distributions arise from linear off-shell observables of the scalar field this way, and uniquely so.

For proof see at distributions are the smooth linear functionals, this prop.

The identification from prop. of linear off-shell observables with compactly supported distributions makes available powerful tools from functional analysis. The key fact is the following:

Proposition

(distributions are generalized functions)

For $n \in \mathbb{N}$ , every compactly supported smooth function $b \in C^\infty_{cp}(\mathbb{R}^n)$ on the Cartesian space $\mathbb{R}^n$ induces a distribution (prop. ), hence a continuous linear functional, by integration against $b$ times the volume form.

\array{ C^\infty(\mathbb{R}^n) &\longrightarrow& \mathbb{R} \\ f &\mapsto& \int_{\mathbb{R}^n} f(x) b(x) \, dvol(x) }

The distributions arising this way are called the non-singular distributions.

This construction is clearly a linear inclusion

C^\infty_{cp}(\mathbb{R}^n) \overset{\phantom{AAA}}{\hookrightarrow} \mathcal{E}'(\mathbb{R}^n)

and in fact this is a dense subspace inclusion for the space of compactly supported distributions $\mathcal{E}'(\mathbb{R}^n)$ equipped with the dual space topology (this def.) to the Fréchet space structure on $C^\infty(\mathbb{R}^n)$ from prop. .

Hence every compactly supported distribution $u$ is the limit of a sequence $\{b_n\}_{n \in \mathbb{N}}$ of compactly supported smooth functions in that for every smooth function $f \in C^\infty(\mathbb{R}^n)$ we have that the value $u(f) \in \mathbb{R}$ is the limit of integrals against $b_n dvol$ :

u(f) \;=\; \underset{n \to \infty}{\lim}\, \int_{\mathbb{R}^n} f(x) b_n(x) dvol(x) \,.

(e. g. Hörmander 90, theorem 4.1.5)

Proposition with prop. implies that with due care we may think of all linear off-shell observables as arising from integration of field histories against some “generalized smooth functions” (namely a limit of actual smooth functions):

Remark

(linear off-shell observables of real scalar field as integration against generalized functions)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime (def. ), whose field bundle $E$ (def. ) is a trivial vector bundle with field coordinates $(\phi^a)$ .

Prop. implies immediately that in this situation linear off-shell observables $A$ (def. ) correspond to tuples $(A_a)$ of compactly supported distributions via

A(\Phi) = \underset{a}{\sum} A_a(\Phi^a) \,.

With prop. it follows furthermore that there is a sequence of tuples of smooth functions $\{(\alpha_n)_{a}\}_{n \in \mathbb{N}}$ such that $A_a$ is the limit of the integrations against these:

A(\Phi) \;=\; \underset{n \to \infty}{\lim} \, \int_\Sigma \Phi^a(x) (\alpha_n)_a(x) \, dvol(x) \,,

where now the sum over the index $a$ is again left notationally implicit.

For handling distributions/linear off-shell observables it is therefore useful to adopt, with due care, shorthand notation as if the limits of the sequences of smooth functions $(\alpha_n)_a$ actually existed, as “generalized functions” $\alpha_a$ , and to set

\int_\Sigma \Phi^a(x) \alpha_a(x) \, dvol(x) \;\coloneqq\; A(\Phi) \,,

This suggests that basic operations on functions, such as their pointwise product, should be extended to distributions, e.g. to a product of distributions. This turns out to exist, as long as the high-frequency modes in the Fourier transform of the distributions being multiplied cancel out – the mathematical reflection of “UV-divergences” in quantum field theory. This we turn to in Free quantum fields below.

These considerations generalize from the field bundle of the real scalar field to general field bundles (def. ) as long as they are smooth vector bundles (def. ):

Definition

(Fréchet topological vector space on spaces of smooth sections of a smooth vector bundle)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) which is a smooth vector bundle (def. ) over Minkowski spacetime (def. ); hence, up to isomorphism, a trivial vector bundle as in example .

On its real vector space $\Gamma_\Sigma(E)$ of smooth sections consider the seminorms indexed by a compact subset $K \subset \Sigma$ and a natural number $k \in \mathbb{N}$ and given by

\array{ \Gamma_\Sigma(E) &\overset{p_K^k}{\longrightarrow}& [0,\infty) \\ \Phi &\mapsto& \underset{ {\vert \alpha\vert} \leq k}{max} \left( \underset{x \in K}{sup} {\vert \partial^\alpha \Phi(x)\vert}\right) \,, }

where on the right we have the absolute values of the partial derivatives of $\Phi$ index by $\alpha$ (1) with respect to any choice of norm on the fibers.

This makes $\Gamma_\Sigma(E)$ a Fréchet topological vector space.

For $K \subset \Sigma$ any closed subset then the sub-space of sections

\Gamma_{\Sigma,K}(E) \hookrightarrow \Gamma_\Sigma(E)

of sections whose support is inside $K$ becomes a Fréchet topological vector spaces with the induced subspace topology, which makes these be closed subspaces.

Finally, the vector spaces of smooth sections with prescribed causal support (def. ) are inductive limits of vector spaces $\Gamma_{\Sigma,K}(E)$ as above, and hence they inherit topological vector space structure by forming the corresponding inductive limit in the category of topological vector spaces. For instance

\Gamma_{\Sigma,cp}(E) \;\coloneqq\; \underset{\underset{ {K \subset \Sigma} \atop {K\, \text{compact}} }{\longrightarrow}}{\lim} \Gamma_{\Sigma,K}(E)

etc.

(Bär 14, 2.1)

Definition

(distributional sections)

Let $E \overset{fb}{\to} \Sigma$ be a smooth vector bundle (def. ) over Minkowski spacetime (def. ).

The vector spaces of smooth sections with restricted support from def. structures of topological vector spaces via def. . We denote the dual topological vector spaces by

\Gamma'_{\Sigma}( E ^*) \;\coloneqq\; (\Gamma_{\Sigma,cp}(E))^* \,.

This is called the space of distributional sections of the dual vector bundle ${E}^*$ .

The support of a distributional section $supp(u)$ is the set of points in $\Sigma$ such that for every neighbourhood of that point $u$ does not vanish on all sections with support in that neighbourhood.

Imposing the same restrictions to the supports of distributional sections as in def. , we have the following subspaces of distributional sections:

\Gamma'_{\Sigma,cp}(E^\ast) , \Gamma'_{\Sigma,\pm cp}(E^\ast) , \Gamma'_{\Sigma,scp}(E^\ast) , \Gamma'_{\Sigma,fcp}(E^\ast) , \Gamma'_{\Sigma,pcp}(E^\ast) , \Gamma'_{\Sigma,tcp}(E^\ast) \;\subset\; \Gamma'_{\Sigma}(E^\ast) .

(Sanders 13, Bär 14)

As before in prop. the actual smooth sections yield examples of distributional sections, and all distributional sections arise as limits of integrations against smooth sections:

Proposition

(non-singular distributional sections)

Let $E \overset{fb}{\to} \Sigma$ be a smooth vector bundle over Minkowski spacetime and let $s \in \{cp, \pm cp, scp, tcp\}$ be any of the support conditions from def. .

Then the operation of regarding a compactly supported smooth section of the dual vector bundle as a functional on sections with this support property is a dense subspace inclusion into the topological vector space of distributional sections from def. :

\array{ \Gamma_{\Sigma,cp}(E^\ast) &\overset{\phantom{A}u_{(-)}\phantom{A} }{\hookrightarrow}& \Gamma'_{\Sigma,s}(E) \\ b &\mapsto& \left( \Phi \mapsto \underset{\Sigma}{\int} b(x) \cdot \Phi(x) \, dvol_\Sigma(x) \right) }

(Bär 14, lemma 2.15)

Proposition

(distribution dualities with causally restricted supports)

Let $E \overset{fb}{\to} \Sigma$ be a smooth vector bundle (def. ) over Minkowski spacetime (def. ).

Then there are the following isomorphisms of topological vector spaces between a) dual spaces of spaces of sections with restricted causal support (def. ) and equipped with the topology from def. and b) spaces of distributional sections with restricted supports, according to def. :

\begin{aligned} \Gamma_{\Sigma,cp}(E)^* &\simeq \Gamma'_{\Sigma}(E^\ast) , \\ \Gamma_{\Sigma,+cp}(E)^* &\simeq \Gamma'_{\Sigma,fcp}(E^\ast) , \\ \Gamma_{\Sigma,-cp}(E)^* &\simeq \Gamma'_{\Sigma,pcp}(E^\ast) , \\ \Gamma_{\Sigma,scp}(E)^* &\simeq \Gamma'_{\Sigma,tcp}(E^\ast) , \\ \Gamma_{\Sigma,fcp}(E)^* &\simeq \Gamma'_{\Sigma,+cp}(E^\ast) , \\ \Gamma_{\Sigma,pcp}(E)^* &\simeq \Gamma'_{\Sigma,-cp}(E^\ast) , \\ \Gamma_{\Sigma,tcp}(E)^* &\simeq \Gamma'_{\Sigma,scp}(E^\ast) , \\ \Gamma_{\Sigma}(E)^* &\simeq \Gamma'_{\Sigma,cp}(E^\ast) . \end{aligned}

(Sanders 13, thm. 4.3, Bär 14, lem. 2.14)

The concept of linear observables naturally generalizes to that of multilinear observables:

Definition

(quadratic off-shell observables)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over a spacetime $\Sigma$ whose field bundle $E$ (def. ) is a super vector bundle.

The external tensor product of vector bundles of the field bundle $E \overset{fb}{\to} \Sigma$ with itself, denoted

E \boxtimes E \overset{}{\to} \Sigma \times \Sigma

is the vector bundle over the Cartesian product $\Sigma \times \Sigma$ , of spacetime with itself, whose fiber over a pair of points $(x_1,x_2)$ is the tensor product $E_{x_1} \otimes E_{x_2}$ of the corresponding field fibers.

Given a field history, hence a section $\phi \in \Gamma_\Sigma(E)$ of the field bundle, there is then the induced section $\phi \boxtimes \phi \in \Gamma_{\Sigma \times \Sigma}(E \boxtimes E)$ .

We say that an off-shell observable

A \;\colon\; \Gamma_\Sigma(E) \longrightarrow \mathbb{C}

is quadratic if it comes from a “graded-symmetric bilinear observable”, namely a smooth function on the space of sections of the external tensor product of the field bundle with itself

B \;\colon\; \Gamma_{\Sigma \times \Sigma}(E \boxtimes E)_{\delta_{EL}\mathbf{L} = 0} \longrightarrow \mathbb{C} \,,

A(\Phi) = B(\Phi,\Phi) \,.

More explicitly: By prop. the quadratic observable $A$ is given by a compactly supported distribution of two variables which in the notation of remark comes from a graded-symmetric matrix of generalized functions $\beta_{a_1 a_2} \in \mathcal{E}'(\Sigma \times \Sigma, E \boxtimes E)$ as

A(\Phi) \;=\; \int_{\Sigma \times \Sigma} \beta_{a_1 a_2}(x_1,x_2) \Phi^{a_1}(x_1) \cdot \Phi^{a_2}(x_2)\, dvol_\Sigma(x_1) dvol_\Sigma(x_2) \,.

This notation makes manifest how the concept of quadratic observables is a generalization of that of quadratic forms coming from bilinear forms.

Definition

(off-shell polynomial observables)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over a spacetime $\Sigma$ whose field bundle $E$ (def. ) is a super vector bundle.

An off-shell observable (def. )

A \;\colon\; \Gamma_\Sigma(E) \longrightarrow \mathbb{C}

is a polynomial observable if it is the sum of a constant, and a linear observable (def. ), and a quadratic observable (def. ) and so on:

(91)

\begin{aligned} A(\Phi) & = \phantom{+} \alpha^{(0)} \\ & \phantom{=} + \int_{\Sigma} \Phi^a(x) \alpha^{(1)}_a(x) \, dvol_\Sigma(x) \\ & \phantom{=} + \int_{\Sigma^2} \Phi^{a_1}(x_1) \cdot \Phi^{a_2}(x_2) \alpha^{(2)}_{a_1 a_2}(x_1, x_2) \, dvol_\Sigma(x_1) dvol_\Sigma(x_2) \\ & \phantom{=} + \int_{\Sigma^3} \Phi^{a_1}(x_1) \cdot \Phi^{a_2}(x_2) \cdot \Phi^{a_3}(x_3) \alpha^{(3)}_{a_1 a_2 a_3}(x_1,x_2,x_3) \, dvol_\Sigma(x_1) dvol_\Sigma(x_2) dvol_\Sigma(x^3) \\ & \phantom{=} + \cdots \,. \end{aligned}

If all the coefficient distributions $\alpha^{(k)}$ are non-singular distributions, then we say that $A$ is a regular polynomial observable.

We write

PolyObs(E)_{reg} \hookrightarrow PolyObs(E) \hookrightarrow Obs(E)

for the subspace of (regular) polynomial off-shell observables.

Example

(polynomial observables of the Dirac field)

Let $E = \Sigma \times S_{odd}$ be the field bundle of the Dirac field (example ).

Then, by prop. , an $\mathbb{R}^{0\vert 1}$ -parameterized plot of the space of off-shell polynomial observables (def. )

A_{(-)} \;\colon\; \mathbb{R}^{0 \vert 1} \longrightarrow PolyObs(\Sigma \times S_{odd})

is of the form

\begin{aligned} A_{(-)} & = a^{(0)} \\ & \phantom{=} + \theta \underset{\Sigma}{\int} a^{(1)}_{\alpha}(x) \mathbf{\Psi}^\alpha(x) dvol_\Sigma(x) \\ & \phantom{=} + \underset{\Sigma^2}{\int} a^{(2)}_{\alpha_1 \alpha_2}(x,y) \mathbf{\Psi}^{\alpha_1}(x_1) \cdot \mathbf{\Psi}^{\alpha_2}(x_2) \, dvol_\Sigma(x_1) \, dvol_\Sigma(x_2) \\ & \phantom{=} + \theta \underset{\Sigma}{\int} a^{(3)}_{\alpha_1 \alpha_2 \alpha_3}(x_1, x_2, x_3) \mathbf{\Psi}^{\alpha_1}(x_1) \cdot \mathbf{\Psi}^{\alpha_2}(x_2) \cdot \mathbf{\Psi}^{\alpha_3}(x_3) \, dvol_\Sigma(x_1) \, dvol_\Sigma(x_2) \, dvol_\Sigma(x_3) \\ & \phantom{=} + \cdots \end{aligned}

for any distributions of several variables $a^{(k)}_{\alpha_1, \cdots , \alpha_k}$ . Here

\mathbf{\Psi}^\alpha(x) \;\colon\; \Gamma_\Sigma(\Sigma \times S_{even}) \longrightarrow \mathbb{C}

are the point-evaluation field observables (example ) on the spinor bundle, and

\theta \in C^\infty(\mathbb{R}^{0\vert 1})_{odd}

is the canonical odd-graded coordinate function on the superpoint $\mathbb{R}^{0 \vert 1}$ (def. ).

Hence all the odd powers of the Dirac-field observables are proportional to $\theta$ . In particular if one considers just a point in the space of polynomial observables

A \;\colon\; \mathbb{R}^{0} \longrightarrow PolyObs(E \times S_{odd})

then all the odd monomials in the field observables of the Dirac field disappear.

Proof

By definition of supergeometric mapping spaces (def. ), there is a natural bijection between $\mathbb{R}^{0 \vert 1}$ -plots $A_{(-)}$ of the space of observables and smooth functionss out of the Cartesian product of $\mathbb{R}^{0 \vert 1}$ with the space of field histories to the complex numbers:

\frac{ \mathbb{R}^{0\vert 1} \overset{ A_{(-)} }{\longrightarrow} [ \Gamma_\Sigma(\Sigma \times S_{odd}), \mathbb{C} ] } { \mathbb{R}^{0 \vert 1} \times \Gamma_\Sigma(\Sigma \times S_{odd}) \longrightarrow \mathbb{C} }

Moreover, by prop. we have that the coordinate functions on the space of field histories of the Dirac bundle are given by the field observables $\mathbf{\Psi}^\alpha(x)$ regarded in odd degree. Now a homomorphism as above has to pull back the even coordinate function on $\mathbb{C}$ to even coordinate functions on this Cartesian product, hence to joint even powers of $\theta$ and $\mathbf{\Psi}^\alpha(x)$ .

$\,$

Next we discuss the restriction of these off-shell polynomial observables to the shell to yield on-shell polynomial observables, characterized by theorem below.

$\,$

Polynomial on-shell Observables and Distributional solutions to PDEs

The evident on-shell version of def. is this:

Definition

(on-shell polynomial observables)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory (def. ) with on-shell space of field histories $\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} \hookrightarrow \Gamma_\Sigma(E)$ . Then an on-shell observable (def. )

A \;\colon\; \Gamma_\Sigma(E) \longrightarrow \mathbb{C}

is an on-shell polynomial observable if it is the restriction of an off-shell polynomial observable $A_{off}$ according to def. :

\array{ \Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0} &\overset{\phantom{A}A\phantom{A}}{\longrightarrow}& \mathbb{C} \\ \downarrow & \nearrow_{\mathrlap{A_{off}}} \\ \Gamma_\Sigma(E) } \,.

Similarly $A$ is an on-shell linear observable or on-shell regular polynomial observable etc. if it is the restriction of a linear observable or regular polynomial observable, respectively, according to def. . We write

PolyObs(E,\mathbf{L}) \hookrightarrow Obs(E,\mathbf{L})

for the subspace of polynomial on-shell observables inside all on-shell observables, and similarly

LinObs(E,\mathbf{L}) \hookrightarrow Obs(E,\mathbf{L})

and

PolyObs(E,\mathbf{L})_{reg} \hookrightarrow Obs(E,\mathbf{L})

etc.

While by def. every off-shell observable induces an on-shell observable simply by restriction (87), different off-shell observables may restrict to the same on-shell observable. It is therefore useful to find a condition on off-shell observables that makes them equivalent to on-shell observables under restriction.

We now discuss such precise characterizations of the off-shell polynomial observables for the case of sufficiently well behaved free field equations of motion – namely Green hyperbolic differential equations, def. below. The main result is theorem below.

While in general the equations of motion are not Green hyperbolic – namely not in the presence of implicit infinitesimal gauge symmetries discussed in Gauge symmetries below – it turns out that up to a suitable notion of equivalence they are equivalent to those that are; this we discuss in the chapter Gauge fixing below.

$\,$

Definition

(derivatives of distributions and distributional solutions of PDEs)

Given a pair of formally adjoint differential operators $P, P^\ast \colon \Gamma_\Sigma(E) \to \Gamma_\Sigma(E^\ast)$ (def. ) then the distributional derivative of a distributional section $u \in \Gamma'_\Sigma(E)$ (def. ) by $P$ is the distributional section $P u \in \Gamma'_\Sigma(E^\ast)$

P u \;\coloneqq\; u(P^\ast(-)) \;\colon\; \Gamma_{\Sigma,cp}(E^\ast) \,.

P u = 0 \;\in\; \Gamma'_\Sigma(E^\ast)

then we say that $u$ is a distributional solution (or generalized solution) of the homogeneous differential equation defined by $P$ .

Example

(ordinary PDE solutions are generalized solutions)

Let $E \overset{fb}{\to} \Sigma$ be a smooth vector bundle over Minkowski spacetime and let $P, P^\ast \colon \Gamma_\Sigma(E) \to \Gamma_\Sigma(E^\ast)$ be a pair of formally adjoint differential operators.

Then for every non-singular distributional section $u_{\Phi} \in \Gamma'_{\Sigma}(E^\ast)$ coming from an actual smooth section $\Phi \in \Gamma_\Sigma(E)$ via prop. the derivative of distributions (def. ) is the distributional section induced from the ordinary derivative of smooth functions:

P u_\Phi \;=\; u_{P \Phi} \,.

In particular $u_\Phi$ is a distributional solution to the PDE precisely if $\Phi$ is an ordinary solution:

P u_\Phi \;=\; 0 \phantom{AAA} \Leftrightarrow \phantom{AAA} P \Phi = 0 \,.

Proof

For all $b \in \Gamma_{\Sigma,cp}(E)$ we have

\begin{aligned} (P u_\Phi)(b) & = u_\Phi(P^\ast b) \\ & = \int u \cdot P^\ast b \, dvol \\ & = \int (P u) \cdot b \, dvol \\ & = u_{P \Phi}(b) \end{aligned}

where all steps are by the definitions except the third, which is by the definition of formally adjoint differential operator (def. ), using that by the compact support of $b$ and the Stokes theorem (prop. ) the term $K(\Phi,b)$ in def. does not contribute to the integral.

Definition

(advanced and retarded Green functions and causal Green function)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) which is a vector bundle (def. ) over Minkowski spacetime (def. ). Let $P \;\colon\;\Gamma_\Sigma(E) \to \Gamma_\Sigma(E^\ast)$ be a differential operator (def. ) on its space of smooth sections.

Then a linear map

\mathrm{G}_{P,\pm} \;\colon\; \Gamma_{\Sigma, cp}(E^\ast) \longrightarrow \Gamma_{\Sigma, \pm cp}(E)

from spaces of smooth sections of compact support to spaces of sections of causally sourced future/past support (def. ) is called an advanced or retarded Green function for $P$ , respectively, if

for all $\Phi \in \Gamma_{\Sigma,cp}(E_1)$ we have

(92) $G_{P,\pm} \circ P(\Phi) = \Phi$

and

(93) $P \circ G_{P,\pm}(\Phi) = \Phi$
the support of $G_{P,\pm}(\Phi)$ is in the closed future cone or closed past cone of the support of $\Phi$ , respectively.

If the advanced/retarded Green functions $G_{P\pm}$ exists, then the difference

(94)

\mathrm{G}_P \coloneqq \mathrm{G}_{P,+} - \mathrm{G}_{P,-}

is called the causal Green function.

(e.g. Bär 14, def. 3.2, cor. 3.10)

Definition

(Green hyperbolic differential equation)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ) which is a vector bundle (def. ) over Minkowski spacetime (def. ).

A differential operator (def. )

P \;\colon\; \Gamma_\Sigma(E) \longrightarrow \Gamma_{\Sigma}(E^\ast)

is called a Green hyperbolic differential operator if $P$ as well as its formal adjoint differential operator $P^\ast$ (def. ) admit advanced and retarded Green functions (def. ).

(Bär 14, def. 3.2, Khavkine 14, def. 2.2)

The two archtypical examples of Green hyperbolic differential equations are the Klein-Gordon equation and the Dirac equation on Minkowski spacetime. For the moment we just cite the existence of the advanced and retarded Green functions for these, we will work these out in detail below in Propagators.

Let

P, P^\ast \;\colon\;\Gamma_\Sigma(E) \overset{}{\longrightarrow} \Gamma_\Sigma(E^\ast)

be a pair of Green hyperbolic differential operators (def. ) which are formally adjoint (def. ). Then also their causal Green functions $\mathrm{G}_P$ and $G_{P^\ast}$ (def. ) are formally adjoint differential operators, up to a sign:

\left( \mathrm{G}_P \right)^\ast \;=\; - \mathrm{G}_{P^\ast} \,.

(Khavkine 14, (24), (25))

We did not require that the advanced and retarded Green functions of a Green hyperbolic differential operator are unique; in fact this is automatic:

Proposition

(advanced and retarded Green functions of Green hyperbolic differential operator are unique)

The advanced and retarded Green functions (def. ) of a Green hyperbolic differential operator (def. ) are unique.

(Bär 14, cor. 3.12

Moreover we did not require that the advanced and retarded Green functions of a Green hyperbolic differential operator come from integral kernels (“propagators”). This, too, is automatic:

Proposition

(causal Green functions of Green hyperbolic differential operators are continuous linear maps)

Given a Green hyperbolic differential operator $P$ (def. ), the advanced, retarded and causal Green functions of $P$ (def. ) are continuous linear maps with respect to the topological vector space structure from def. and also have a unique continuous extension to the spaces of sections with larger support (def. ) as follows:

\begin{aligned} \mathrm{G}_{P,+} &\;\colon\; \Gamma_{\Sigma, pcp}(E^\ast) \longrightarrow \Gamma_{\Sigma, pcp}(E) , \\ \mathrm{G}_{P,-} &\;\colon\; \Gamma_{\Sigma, fcp}(E^\ast) \longrightarrow \Gamma_{\Sigma, fcp}(E) , \\ \mathrm{G}_{P} &\;\colon\; \Gamma_{\Sigma, tcp}(E^\ast) \longrightarrow \Gamma_{\Sigma}(E) , \end{aligned}

such that we still have the relation

\mathrm{G}_P = \mathrm{G}_{P,+} - \mathrm{G}_{P,-}

and

P \circ \mathrm{G}_{P,\pm} = \mathrm{G}_{P,\pm} \circ P = id

and

supp \mathrm{G}_{P,\pm}({\alpha}^*) \subseteq J^\pm(supp {\alpha}^*) \,.

By the Schwartz kernel theorem the continuity of $\mathrm{G}_{\pm}, \mathrm{G}$ implies that there are integral kernels

\Delta_{\pm} \;\in\; \Gamma'_{\Sigma \times \Sigma}( E \boxtimes_\Sigma E )

such that, in the notation of generalized functions,

(G_{\pm} \alpha^\ast)(x) \;=\; \underset{\Sigma}{\int} \Delta_\pm(x,y) \cdot \alpha^\ast(y) \, dvol_\Sigma(y) \,.

These integral kernels are called the advanced and retarded propagators. Similarly the combination

(95)

\Delta \;\coloneqq\; \Delta_+ - \Delta_-

is called the causal propagator.

(Bär 14, thm. 3.8, cor. 3.11)

We now come to the main theorem on polynomial observables:

Lemma

(exact sequence of Green hyperbolic differential operator)

Let $\Gamma_\Sigma(E) \overset{P}{\longrightarrow} \Gamma_\Sigma(E^\ast)$ be a Green hyperbolic differential operator (def. ) with causal Green function $\mathrm{G}$ (def. ). Then the sequences

(96)

\array{ 0 &\to& \Gamma_{\Sigma,cp}(E) &\overset{P}{\longrightarrow}& \Gamma_{\Sigma,cp}(E^\ast) &\overset{\mathrm{G}_P}{\longrightarrow}& \Gamma_{\Sigma,scp}(E) &\overset{P}{\longrightarrow}& \Gamma_{\Sigma,scp}(E^\ast) &\to& 0 \\ \\ 0 &\to& \Gamma_{\Sigma,tcp}(E) &\overset{P}{\longrightarrow}& \Gamma_{\Sigma,tcp}(E^\ast) &\overset{\mathrm{G}_P}{\longrightarrow}& \Gamma_{\Sigma}(E) &\overset{P}{\longrightarrow}& \Gamma_{\Sigma}(E^\ast) &\to& 0 }

of these operators restricted to functions with causally restricted supports as indicated (def. ) are exact sequences of topological vector spaces and continuous linear maps between them.

Under passing to dual spaces and using the isomorphisms of spaces of distributional sections (def. ) from prop. this yields the following dual exact sequence of topological vector spaces and continuous linear maps between them:

(97)

\array{ 0 &\to& \Gamma'_{\Sigma,tcp}(E) &\overset{P^*}{\longrightarrow}& \Gamma'_{\Sigma,tcp}(E^\ast) &\overset{-\mathrm{G}_{P^*}}{\longrightarrow}& \Gamma'_{\Sigma}(E) &\overset{P^*}{\longrightarrow}& \Gamma'_{\Sigma}(E^\ast) &\to& 0 \\ \\ 0 &\to& \Gamma'_{\Sigma,cp}(E) &\overset{P^*}{\longrightarrow}& \Gamma'_{\Sigma,cp}(E^\ast) &\overset{-\mathrm{G}_{P^*}}{\longrightarrow}& \Gamma'_{\Sigma,scp}(E) &\overset{P^*}{\longrightarrow}& \Gamma'_{\Sigma,scp}(E^\ast) &\to& 0 }

This is due to Igor Khavkine, based on (Khavkine 14, prop. 2.1); for proof see at Green hyperbolic differential operator this lemma.

Corollary

(on-shell space of field histories for Green hyperbolic free field theories)

Let $(E,\mathbf{L})$ be a free field theory Lagrangian field theory (def. ) whose Euler-Lagrange equation of motion $P \Phi = 0$ is Green hyperbolic (def. ).

Then the on-shell space of field histories (or of field histories with spatially compact support, def. ) is, as a vector space, linearly isomorphic to the quotient space of compactly supported sections (or of temporally compactly supported sections, def. ) by the image of the differential operator $P$ , and this isomorphism is given by the causal Green function $\mathrm{G}_P$ (94)

(98)

\array{ \Gamma_{\Sigma,tcp}(E^\ast)/im(P) &\underoverset{\simeq}{\phantom{A}\mathrm{G}_P \phantom{A}}{\longrightarrow}& ker(P) \;=\; \Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0} \\ \Gamma_{\Sigma,cp}(E^\ast)/im(P) &\underoverset{\simeq}{\phantom{A}\mathrm{G}_P\phantom{A}}{\longrightarrow}& ker_{scp}(P) \;=\; \Gamma_{\Sigma,scp}(E)_{\delta_{EL}\mathbf{L} = 0} \,. }

Proof

This is a direct consequence of the exactness of the sequence (96) in lemma .

We spell this out for the statement for $\Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0}$ , which follows from the first line in (96), the first statement similarly follows from the second line of (96):

First the on-shell space of field histories is the kernel of $P$ , by definition of free field theory (def. )

\Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0} \;=\; ker_{scp}(P) \,.

Second, exactness of the sequence (96) at $\Gamma_{\Sigma,scp}(E)$ means that the kernel $ker_{scp}(P)$ of $P$ equals the image $im(\mathrm{G}_{P})$ . But by exactness of the sequence at $\Gamma_{\Sigma,cp}(E^\ast)$ it follows that $\mathrm{G}_P$ becomes injective on the quotient space $\Gamma_{\Sigma,cp}(E)^\ast/im(P)$ . Therefore on this quotient space it becomes an isomorphism onto its image.

Remark

Under passing to dual vector spaces, the linear isomorphism in corollary in turn yields linear isomorphisms of the form

(99)

\array{ \left(\Gamma_{\Sigma,cp}(E^\ast)/im(P)\right)^\ast &\underoverset{\simeq}{(-)\circ \mathrm{G}_P}{\longleftarrow}& \left(ker_{scp}(P)\right)^\ast \\ \left(\Gamma_\Sigma(E^\ast)/im(P)\right)^\ast &\underoverset{\simeq}{(-)\circ \mathrm{G}_P }{\longleftarrow}& \left(ker(P)\right)^\ast } \,.

Except possibly for the issue of continuity this says that the linear on-shell observables (def. ) of a Green hyperbolic free field theory are equivalently those linear off-shell observables which are generalized solutions of the formally dual equation of motion according to def. .

That this remains true also for topological vector space structure follows with the dual exact sequence (97). This is the statement of prop. below.

Proposition

(distributional sections on a Green hyperbolic solution space are the generalized PDE solutions)

Let $P, P \ast \;\colon\; \Gamma_\Sigma(E) \overset{}{\longrightarrow} \Gamma_\Sigma(E^\ast)$ be a pair of Green hyperbolic differential operators (def. ) which are formally adjoint (def. ).

Then

the canonical pairing (from prop. )

$\array{ \Gamma'_{\Sigma,cp}(E^\ast) &\otimes& \Gamma_\Sigma(E) &\overset{}{\longrightarrow}& \mathbb{C} \\ \alpha^\ast &,& \Phi &\mapsto& \int \alpha^\ast_a(x) \Phi^a(x)\, dvol_\Sigma(x) }$

induces a continuous linear isomorphism

(100) $(ker(P))^\ast \;\simeq\; \Gamma'_{\Sigma,cp}(E^\ast)/im_{cp}(P^\ast)$
a continuous linear functional on the solution space

$u_{sol} \in \left(ker(P)\right)^\ast$

is equivalently a distributional section (def. ) whose support is spacelike compact (def. , prop. )

$u \in \Gamma'_{\Sigma,scp}(E)$

and which is a distributional solution (def. ) to the differential equation

$P^\ast u = 0 \,.$

Similarly, a continuous linear functional on the subspace of solutions that have spatially compact support (def. )

$u_{sol} \in \left(ker(P)_{scp}\right)^\ast$

is equivalently a distributional section (def. ) without constraint on its distributional support

$u \in \Gamma'_{\Sigma}(E)$

and which is a distributional solution (def. ) to the differential equation

$P^\ast u = 0 \,.$

Moreover, these linear isomorphisms are both given by composition with the causal Green function $\mathrm{G}$ (def. ):

$\array{ \left(ker(P)\right)^\ast &\underoverset{\simeq}{(-)\circ \mathrm{G}}{\longrightarrow}& \left\{ u \in \Gamma'_{\Sigma,scp}(E) \,\vert\, P^\ast u = 0 \right\} \\ \left(ker_{scp}(P)\right)^\ast &\underoverset{\simeq}{(-)\circ \mathrm{G}}{\longrightarrow}& \left\{ u \in \Gamma'_{\Sigma}(E) \,\vert\, P^\ast u = 0 \right\} } \,.$

This follows from the exact sequence in lemma . For details of the proof see at Green hyperbolic differential operator this prop., due to Igor Khavkine.

In conclusion we have found the following:

Theorem

(linear observables of Green free field theory are the distributional solutions to the formally adjoint equations of motion)

Let $(E,\mathbf{L})$ be a Lagrangian free field theory (def. ) which is a free field theory (def. ) whose Euler-Lagrange differential equation of motion $P \Phi = 0$ (def. ) is Green hyperbolic (def. ), such as the Klein-Gordon equation (example ) or the Dirac equation (example ). Then:

The linear off-shell observables (def. ) are equivalently the compactly supported distributional sections (def. ) of the field bundle:

$LinObs(E) \;\simeq\; \Gamma'_{\Sigma,cp}(E)$
The linear on-shell observables (def. ) are equivalently the linear off-shell observables modulo the image of the differential operator $P$ :

(101) $LinObs(E,\mathbf{L}) \simeq LinObs(E)/im(P) \,.$

More generally the on-shell polynomial observables are identified with the off-shell polynomial observables (def. ) modulo the image of $P$ :

(102) $PolyObs(E,\mathbf{L}) \simeq PolyObs(E)/im(P) \,.$
The linear on-shell observables (def. ) are also equivalently those spacelike compactly supported compactly distributional sections (def. ) which are distributional solutions of the formally adjoint equations of motion (def. ), and this isomorphism is exhibited by precomposition with the causal propagator $\mathrm{G}$ :

$LinObs(E,\mathbf{L}) \;\underoverset{\simeq}{\phantom{A}(-)\circ\mathrm{G}_P \phantom{A}}{\longrightarrow}\; \left\{ A \in \Gamma'_{\Sigma,scp}(E) \;\vert\; P^\ast A = 0 \right\}$

Similarly the linear on-shell observables on spacelike compactly supported on-shell field histories (88) are equivalently the distributional solutions without constraint on their support:

$LinObs(E_{scp},\mathbf{L}) \;\underoverset{\simeq}{\phantom{A}(-) \circ \mathrm{G}_P \phantom{A}}{\longrightarrow}\; \left\{ A \in \Gamma'_{\Sigma}(E) \;\vert\; P^\ast A = 0 \right\}$

Proof

The first statement follows with prop. applied componentwise. The same proof applies verbatim to the subspace of solutions, showing that $LinObs(E,\mathbf{L}) \simeq \left( ker(P)\right)^\ast$ , with the dual topological vector space on the right. With this the second and third statement follows by prop. .

We will be interested in those linear observables which under the identification from theorem correspond to the non-singular distributions (because on these the Poisson-Peierls bracket of the theory is defined, theorem below):

Definition

(regular linear observables and observable-valued distributions)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory (def. ) whose Euler-Lagrange equations of motion (prop. ) is Green hyperbolic (def. ).

According to def. the regular linear observables among the linear on-shell observables (def. ) are the non-singular distributions on the on-shell space of field histories, hence the image

LinObs(E_{scp},\mathbf{L})_{reg} \hookrightarrow LinObs(E_{scp},\mathbf{L})

of the map

(103)

\array{ \mathbf{\Phi} &\colon& \Gamma_{\Sigma,cp}(E^\ast) &\longrightarrow& LinObs(E_{scp},\mathbf{L}) &\hookrightarrow& Obs(E_{scp},\mathbf{L}) \\ && \alpha^\ast &\mapsto& \left( \Phi \mapsto \underset{\Sigma}{\int} \alpha^\ast_a(x) \Phi^a(x) \, dvol_\Sigma(x) \right) }

By theorem we have the identification (100) (101)

(104)

LinObs(E_{scp},\mathbf{L})_{reg} \;\simeq\; \Gamma_{\Sigma,scp}(E^\ast)/im(P) \,.

The point-evaluation field observables $\mathbf{\Phi}^a(x)$ (example ) are linear observables (example ) but far from being regular (103) (except in spacetime dimension $p +1 = 0+1$ ). But the regular observables are precisely the averages (“smearings”) of these point evaluation observables against compactly supported weights.

Viewed this way, the defining inclusion of the regular linear observables (103) is itself an observable valued distribution

(105)

\array{ \mathbf{\Phi} &\colon& \Gamma_{\Sigma,cp}(E^\ast) &\hookrightarrow& LinObs(E,\mathbf{L}) \\ && \alpha^\ast &\mapsto& \underset{\Sigma}{\int} \alpha^\ast_a(x) \mathbf{\Phi}^a(x)\, dvol_\Sigma(x) }

which to a “smearing function” $\alpha^\ast$ assigns the observable which is the field observable smeared by (i.e. averaged against) that smearing function.

Below in Free quantum fields we discuss how the polynomial Poisson algebra of regular polynomial observables of a free field theory may be deformed to a non-commutative algebra of quantum observables. Often this may be represented by linear operators acting on some Hilbert space. In this case then $\mathbf{\Phi}$ above becomes a continuous linear functional from $\Gamma_{\Sigma,cp}(E)$ to a space of linear operators on some Hilbert space. As such it is then called an operator-valued distribution.

$\,$

Local observables

We now discuss the sub-class of those observables which are “local”.

Definition

(spacetime support)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle over a spacetime $\Sigma$ (def. ), with induced jet bundle $J^\infty_\Sigma(E)$

For every subset $S \subset \Sigma$ let

\array{ J^\infty_\Sigma(E)\vert_S &\overset{\iota_S}{\hookrightarrow}& J^\infty_\Sigma(E) \\ \downarrow &(pb)& \downarrow \\ S &\hookrightarrow& \Sigma }

be the corresponding restriction of the jet bundle of $E$ .

The spacetime support $supp_\Sigma(A)$ of a differential form $A \in \Omega^\bullet(J^\infty_\Sigma(E))$ on the jet bundle of $E$ is the topological closure of the maximal subset $S \subset \Sigma$ such that the restriction of $A$ to the jet bundle restrited to this subset does not vanishes:

supp_\Sigma(A) \coloneqq Cl( \{ x \in \Sigma | \iota_{\{x\}}^\ast A \neq 0 \} )

We write

\Omega^{r,s}_{\Sigma,cp}(E) \coloneqq \left\{ A \in \Omega^{r,s}_\Sigma(E) \;\vert\; supp_\Sigma(A) \, \text{is compact} \right\} \;\hookrightarrow\; \Omega^{r,s}_\Sigma(E)

for the subspace of differential forms on the jet bundle whose spacetime support is a compact subspace.

Definition

(transgression of variational differential forms to space of field histories)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle over a spacetime $\Sigma$ (def. ). and let

\Sigma_r \hookrightarrow \Sigma

be a submanifold of spacetime of dimension $r \in \mathbb{N}$ . Recall the space of field histories restricted to its infinitesimal neighbourhood, denoted $\Gamma_{\Sigma_r}(E)$ (def. ).

Then the operation of transgression of variational differential forms to $\Sigma_r$ is the linear map

\tau_{\Sigma_r} \;\colon\; \Omega^{\bullet,\bullet}_{\Sigma,cp}(E) \overset{ }{\longrightarrow} \Omega^\bullet\left( \Gamma_{\Sigma_r}(E) \right)

that sends a variational differential form $A \in \Omega^{\bullet,\bullet}_{\Sigma,cp}(E)$ to the differential form $\tau_{\Sigma_r}A\in \Omega^\bullet(\Gamma_{\Sigma_r}(E))$ (def. , example ) which to a smooth family on field histories

\Phi_{(-)}(-) \;\colon\; U \times N_\Sigma \Sigma_r \longrightarrow E

assigns the differential form given by first forming the pullback of differential forms along the family of jet prolongation $j^\infty_\Sigma(\Phi_{(-)})$ followed by the integration of differential forms over $\Sigma_r$ :

(\tau_{\Sigma}A)_\Phi \;\coloneqq\; \int_{\Sigma_r} (j^\infty_\Sigma(\Phi_{(-)}))^\ast A \;\in\; \Omega^\bullet(U) \,.

Remark

(transgression to dimension $r$ picks out horizontal $r$ -forms)

In def. we regard integration of differential forms over $\Sigma_r$ as an operation defined on differential forms of all degrees, which vanishes except on forms of degree $r$ , and hence transgression of variational differential forms to $\Sigma_r$ vanishes except on the subspace

\Omega^{r,\bullet}_\Sigma(E) \;\subset\; \Omega^{\bullet,\bullet}_\Sigma(E)

of forms of horizontal degree $r$ .

Example

(adiabatically switched action functional)

Given a field bundle $E \overset{fb}{\longrightarrow} \Sigma$ , consider a local Lagrangian density (def. )

\mathbf{L} \in \Omega^{p+1,0}_\Sigma(E) \,.

For any bump function $b \in C^\infty_{cp}(\Sigma)$ , the transgression of $b \mathbf{L}$ (def. ) is called the action functional

\mathcal{S}_b \mathbf{L} \coloneqq \tau_{\Sigma} \left( b \mathbf{L} \right) \;\colon\; \Gamma_\Sigma(E) \longrightarrow \mathbb{R}

induced by $\mathbf{L}$ , “adiabatically switched” by $b$ .

Specifically if the field bundle is a trivial vector bundle as in example , such that the Lagrangian density may be written in the form

\mathbf{L} \;=\; L \left( (x^\mu), (\phi^a), (\phi^a_{,\mu}), \cdots \right) \, b dvol_\Sigma \;\in\; \Omega^{p+1,0}_{\Sigma,cp}( E ) \,.

then its action functional takes a field history $\Phi$ to the value

\mathcal{S}_{b \mathbf{L}}(\Phi) \:\colon\; \int_\Sigma L \left( x, \left( \Phi^a(x) \right), \left(\frac{\partial \Phi^a}{\partial x^\mu}(x)\right), \cdots \right) \, b(x) dvol_\Sigma(x)

Proposition

(transgression compatible with variational derivative)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle over a spacetime $\Sigma$ (def. ) and let $\Sigma_r \hookrightarrow \Sigma$ be a submanifold possibly with boundary $\partial \Sigma_r \hookrightarrow \Sigma_r$ . Write

\Gamma_{\Sigma_r}(E) \overset{(-)\vert_{\partial \Sigma_r}}{\longrightarrow} \Gamma_{\partial \Sigma_r}(E)

for the boundary restriction map.

Then the operation of transgression of variational differential forms (def. )

\tau_{\Sigma} \;\colon\; \Omega^{\bullet,\bullet}_{\Sigma,cp}(E) \longrightarrow \Omega^\bullet\left(\Gamma_{\Sigma_r}(E)\right)

is compatible with the variational derivative $\delta$ and with the total spacetime derivative $d$ in the following way:

On variational forms that are in the image of the total spacetime derivative a transgressive variant of the Stokes' theorem (prop. ) holds:

$\tau_{\Sigma_r}(d \alpha) \;=\; ((-)\vert_{\partial \Sigma})^\ast \tau_{\partial \Sigma_r}( \alpha)$
Transgression intertwines, up to a sign, the variational derivative $\delta$ on variational differential forms with the plain de Rham differential on the space of field histories:

$\tau_{\Sigma}\left( \delta \alpha \right) \;=\; (-1)^{p+1}\, d \,\tau_{\Sigma}(\alpha) \,.$

Proof

Regarding the first statement, consider a horizontally exact variational form

d \alpha \in \Omega^{r,s}_{\Sigma,cp}(E) \,.

By prop. the pullback of this form along the jet prolongation of fields is exact in the $\Sigma$ -direction:

(j^\infty_\Sigma\Phi_{(-)})^\ast(d \alpha ) \;=\; d_\Sigma (j^\infty_\Sigma\Phi_{(-)})^\ast \alpha \,,

(where we write $d = d_U + d_\Sigma$ for the de Rham differential on $U \times \Sigma$ ). Hence by the ordinary Stokes' theorem (prop. ) restricted to any $\Phi_{(-)} \colon U \to \Gamma_{\Sigma_r}(E)$ with restriction $(-)\vert_{\partial \Sigma_r} \circ \Phi_{(-)} \colon U \to \Gamma_{\Sigma_r}(E)$ the relation

\begin{aligned} (\Phi_{(-)})^\ast \tau_{\Sigma_r}(d \alpha) & = \int_{\Sigma_r} d_{\Sigma_r} (j^\infty_\Sigma\Phi_{(-)})^\ast\alpha \\ & = \int_{\partial \Sigma_r} (j^\infty_\Sigma\Phi_{(-)})^\ast\alpha \\ & = \int_{\partial \Sigma_r} (j^\infty_\Sigma ( (-)\vert_{\Sigma_r} \circ \Phi_{(-)}) )^\ast\alpha \\ & = ( (-)\vert_{\Sigma_r} \circ \Phi_{(-)} )^\ast \tau_{\partial \Sigma_r}(\alpha) \\ & = (\Phi_{(-)})^\ast ((-)\vert_{\Sigma_r})^\ast \tau_{\partial \Sigma_r}(\alpha) \,. \end{aligned} \,.

Regarding the second statement: by the Leibniz rule for de Rham differential (product law of differentiation) it is sufficient to check the claim on variational derivatives of local coordinate functions

\delta \phi^a_{\mu_1 \cdots \mu_k} b \in \Omega^{0,1}_\Sigma(E) \,.

The pullback of differential forms (prop. ) along the jet prolongation $j^\infty_\Sigma(\Phi_{(-)}) \colon U \times \Sigma \to J^\infty_\Sigma(E)$ has two contributions: one from the variation along $\Sigma$ , the other from variation along $U$ :

By prop. , for fixed $u \in U$ the pullback of $\delta \phi^a_{\mu_1 \cdots \mu_k}$ along the jet prolongation vanishes.
For fixed $x \in \Sigma$ , the pullback of the full de Rham differential $\mathbf{d} \phi^a_{\mu_1\cdots \mu_k}$ is

$\begin{aligned} (\Phi_{(-)}(x))^\ast( \mathbf{d} \phi^a_{\mu_1\cdots \mu_k} ) & = d_U (\Phi_{(-)}(x))^\ast(\phi^a_{\mu_1\cdots \mu_k}) \\ & = d_U \frac{ \partial^k \Phi_{(-)}(x)}{\partial x^{\mu^1} \cdots \partial x^{\mu_k}} \end{aligned}$

(since the full de Rham differentials always commute with pullback of differential forms by prop. ), while the pullback of the horizontal derivative $d \phi^a_{\mu_1\cdots \mu_k} = \phi^a_{\mu_1 \cdots \mu_{k} \mu_{k+1}} \mathbf{d}x^{\mu_{k+1}}$ vanishes at fixed $x \in \Sigma$ .

This implies over the given smooth family $\Phi_{(-)}$ that

\begin{aligned} \tau_\Sigma\left( \delta \phi^a_{,\mu_1 \cdots \mu_k} b \right)\vert_{\Phi_{(-)}} & = \tau_\Sigma\left( \mathbf{d} ( \phi^a_{,\mu_1 \cdots \mu_k} b) \right) \vert_{\Phi_{(-)}} - \underset{ = 0 }{ \underbrace{ \tau_\Sigma \left( d (\phi^a_{,\mu_1 \cdots \mu_k} b) \right)\vert_{\Phi_{(-)}} }} \\ & = \int_\Sigma d_U (\Phi_{(-)})^\ast ( \phi^a_{\mu_1\cdots \mu_k} b ) \\ & = (-1)^{p+1} d_U \int_\Sigma (\Phi_{(-)})^\ast ( \phi^a_{\mu_1\cdots \mu_k} b ) \\ & = (-1)^{p+1} d_U \tau_{\Sigma}( \Phi_{(-)} )^\ast ( \phi^a_{\mu_1 \cdots \mu_k} ) \,. \end{aligned}

and since this holds covariantly for all smooth families $\Phi_{(-)}$ , this implies the claim.

Example

(cohomological integration by parts on the jet bundle)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle (def. ).

Prop. says in particular that the operation of integration by parts in an integral is “localized” to a cohomological statement on horizontal differential forms: Let

\alpha_1, \alpha_2 \;\in\; \Omega^{\bullet,\bullet}_\Sigma(E)

be two variational differential forms (def. ), of total horizontal degree $p$ (hence one less than the dimension of spacetime $\Sigma$ ).

Then the derivation-property of the total spacetime derivative says that

(106)

(d \alpha_1) \wedge \alpha_2 \;=\; - (-1)^{deg(\alpha_1)} \alpha_1 \wedge ( d \alpha_2 ) \;\; d( \alpha_1 \wedge \alpha_2 ) \;\in\; \Omega^{p+1,\bullet}_\Sigma(E) \,,

hence that we may “throw over” the spacetime derivative from the factor $\alpha_1$ to the factor $\alpha_2$ , up to a sign, and up to a total spacetime derivative $d (\alpha_1 \wedge \alpha_2)$ . By prop. this last term vanishes under transgression $\tau_\sigma$ to a spacetime without manifold with boundary, so that the above equation becomes

\tau_\Sigma( d \alpha_1) \wedge \alpha_2 ) \;=\; - (-1)^{deg(\alpha_1)} \tau_\Sigma( \alpha_1 \wedge d \alpha_2 ) \,,

hence

\underset{\Sigma}{\int} (d j^\infty_\sigma(\alpha_1)) \wedge j^\infty_\Sigma(\alpha_2) \;=\; - (-1)^{deg(\alpha_1)} \underset{\Sigma}{\int} j^\infty_\Sigma(\alpha_1) \wedge d j^\infty_\Sigma(\alpha_2) \,.

This last statement is the statement of integration by parts under an integral.

Notice that these integrals (and hence the actual integration by parts-rule) only exist if $\alpha_1 \wedge \alpha_2$ has compact spacetime support, while the “cohomological” avatar (106) of this relation on the jet bundle holds without such a restriction.

Example

(variation of the action functional)

Given a Lagrangian field theory $(E,\mathbf{L})$ (def. ) then the derivative of its adiabatically switched action functional (def. ) equals the transgression of the Euler-Lagrange variational derivative $\delta_{EL} \mathbf{L}$ (def. ):

d \mathcal{S}_{b \mathbf{L}} \;=\; \tau_\Sigma( b \delta_{EL}\mathbf{L} ) \,.

Proof

By the second statement of prop. we have

\begin{aligned} d \mathcal{S}_{b \mathbf{L}} & = \tau_\Sigma( \delta ( b \mathbf{L} ) ) \end{aligned} \,,

Moreover, by prop. this is

\begin{aligned} \cdots & = \tau_\Sigma( \delta_{EL} b \mathbf{L} + d \Theta_{BFV,b} ) \\ & = \tau_\Sigma( \delta_{EL} b \mathbf{L} ) + \underset{= 0}{\underbrace{\tau_\Sigma( d \Theta_{BFV,b} )}} \end{aligned} \,,

where the second term vanishes by the first statement of prop. .

Proposition

(principle of extremal action)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

The de Rham differential $d \mathcal{S}_{b\mathbf{L}}$ of the action functional (example ) vanishes at a field history

\Phi \in \Gamma_\Sigma(E)

for all adiabatic switchings $b \in C^\infty_{cp}(\Sigma)$ constant on some subset $\mathcal{O} \subset \Sigma$ (def. ) on those smooth collections of field histories

\Phi_{(-)} \;\colon\; U \longrightarrow \Gamma_\Sigma(E)

around $\Phi$ which, as functions on $U$ , are constant outside $\mathcal{O}$ (example , example ) precisely if $\Phi$ solves the Euler-Lagrange equations of motion (def. ):

\left( \underset{ { {\mathcal{O} \subset \Sigma} \atop { b\vert_{\mathcal{O}} = const } } \atop { \Phi_{(-)}\vert_{\Sigma \setminus \mathcal{O}} = const } }{\forall} \left( (\Phi_{(-)})^\ast d \mathcal{S}_{b \mathbf{L}}(\Phi) = 0 \right) \right) \;\Leftrightarrow\; \left( j^\infty_\Sigma(\Phi)^\ast \left( \frac{\delta_{EL} L}{\delta \phi^a} \right) = 0 \right) \,.

Proof

By prop. we have

(\Phi_{(-)})^\ast d \mathcal{S}_{b \mathbf{L}} \;=\; \int_\Sigma j^\infty_\Sigma(\Phi_{(-)})^\ast ( \delta_{EL} b \mathbf{L} ) \,.

By the assumption on $\Phi_{(-)}$ it follows that after pullback to $U$ the switching function $b$ is constant, so that it commutes with the differentials:

(\Phi_{(-)})^\ast d \mathcal{S}_{b \mathbf{L}} \;=\; \int_\Sigma b j^\infty_\Sigma(\Phi_{(-)})^\ast ( \delta_{EL} \mathbf{L} ) \,.

This vanishes at $\Phi$ for all $\Phi_{(-)}$ precisely if all components of $j^\infty_\Sigma(\Phi_{(-)})^\ast ( \delta_{EL} \mathbf{L} )$ vanish, which is the statement of the Euler-Lagrange equations of motion.

Definition

(local observables)

Given a Lagrangian field theory $(E,\mathbf{L})$ (def. ) the local observables are the horizontal p+1-forms

of compact spacetime support (def. )
modulo total spacetime derivatives

LocObs(E) \;\coloneqq\; \left(\Omega^{p+1,0}_{\Sigma,cp}(E)/(im(d))\right)\vert_{\mathcal{E}^\infty}

which we may identify with the subspace of all observables (86) that arises as the image under transgression of variational differential forms $\tau_\Sigma$ (def. ) of local observables to functionals on the on-shell space of field histories (67):

\array{ LocObs(E) &\overset{\tau_\Sigma}{\hookrightarrow}& Obs(E) \\ \alpha &\maptos& \tau_\Sigma \alpha } \,.

This is a sub-vector space inside all observables which is however not closed under the pointwise product of observables (89) (unless $E=0$ ). We write

MultiLocObs(E) \hookrightarrow Obs(E)

for the smallest subalgebra of observables, under the pointwise product (89), that contains all the local observables. This is called the algebra of multilocal observables.

The intersection of the (multi-)local observables with the off-shell polynomial observables (def. ) are the (multi-)local polynomial observables

(107)

PolyLocObs(E) \hookrightarrow PolyMultiLocObs(E) \overset{\text{dense}}{\hookrightarrow} PolyObs(E) \hookrightarrow Obs(E)

Example

(local observables of the real scalar field)

Consider the field bundle of the real scalar field (example ).

A typical example of local observables (def. ) in this case is the “field amplitude averaged over a given spacetime region” determined by a bump function $b \in C^\infty_{cp}(\Sigma)$ . On an on-shell field history $\Phi$ this observable takes as value the integral

\tau_\Sigma(b \phi)(\Phi) \;=\; \int_\Sigma \Phi(x) b(x) dvol_\Sigma(x) \,.

Example

(local observables of the electromagnetic field)

Consider the field bundle for free electromagnetism on Minkowski spacetime $\Sigma$ .

Then for $b \in C^\infty(\Sigma)$ a bump function on spacetime, the transgression of the universal Faraday tensor (def. ) against $b$ times the volume form is a local observable (def. ), namely the field strength (20) of the electromagnetic field averaged over spacetime.

For the construction of the algebra of quantum observables it will be important to notice that the intersection between local observables and regular polynomial observables is very small:

Example

(local regular polynomial observables are linear observables)

An observable (def. ) which is

a regular polynomial observable (def. );
a local observable (def. )

is necessarily

a linear observable (def. ).

This is because non-linear local expressions are polynomials in the sense of def. with delta distribution-coefficients, for instance for the real scalar field the $\Phi^2$ interaction term is

\int (\Phi(x))^2 \, dvol_\Sigma(x) \;=\; \int \int \Phi(x) \Phi(y) \underset{ = \alpha^{(2)}(x,y) }{\underbrace{\delta(x-y)}} \, dvol_\Sigma(y)

and so its coefficient $\alpha^{(2)}$ is manifestly not a non-singular distribution.

$\,$

Infinitesimal observables

The definition of observables in def. and specifically of local observables in def. uses explicit restriction to the shell, hence, by the principle of extremal action (prop. ) to the “critical locus” of the action functional. Such critical loci are often hard to handle explicitly. It helps to consider a “homological resolution” that is given, in good circumstances, by the corresponding “derived critical locus”. These we consider in detail below in Reduced phase space. In order to have good control over these resolutions, we here consider the first perturbative aspect of field theory, namely we consider the restriction of local observables to just an infinitesimal neighbourhood of a background on-shell field history:

Definition

(local observables around infinitesimal neighbourhood of background on-shell field history)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle $E$ is a trivial vector bundle (example ) and whose Lagrangian density $\mathbf{L}$ is spacetime-independent (example ). Let $\Sigma \times \{\varphi\} \hookrightarrow \mathcal{E}$ be a constant section of the shell (59) as in example .

Then we write

LocObs_\Sigma(E,\varphi)

for the restriction of the local observables (def. ) to the fiberwise infinitesimal neighbourhood (example ) of $\Sigma \times \{\varphi\}$ .

Explicitly, this means the following:

First of all, by prop. the dependence of the Lagrangian density $\mathbf{L}$ on the order of field derivatives is bounded by some $k \in \mathbb{N}$ on some neighbourhood of $\varphi$ and hence, by the spacetime independence of $\mathbf{L}$ , on some neighbourhood of $\Sigma \times \{\varphi\}$ .

Therefore we may restrict without loss to the order- $k$ jets. By slight abuse of notation we still write

\mathcal{E} \hookrightarrow J^k_\Sigma(E)

for the corresponding shell. It follows then that the restriction of the ring $\Omega^{0,0}_{\Sigma,cp}(E)$ of smooth functions on the jet bundle to the infinitesimal neighbourhood (example ) is equivalently the formal power series ring over $C^\infty_{cp}(\Sigma)$ in the variables

((\phi^a- \varphi^a), (\phi^a_{,\mu}- \varphi^a_{,\mu}), \cdots, (\phi^a_{,\mu_1 \cdots \mu_k} - \varphi^a_{,\mu_1 \cdots \mu_k}) )

We denote this by

(108)

\Omega^{0,0}_{\Sigma,cp}(E,\varphi) \;\coloneqq\; C^\infty_{cp}(\Sigma)\left[ \left[ (\phi^a - \varphi^a ), (\phi^a_{,\mu} -\varphi^a_{,\mu}), \cdots, (\phi^a_{,\mu_1 \cdots \mu_k}- \varphi^a_{,\mu_1 \cdots \mu_k}) \right] \right] \,.

A key consequence is that the further restriction of this ring to the shell $\mathcal{E}^\infty$ (52) is now simply the further quotient ring by the ideal generated by the total spacetime derivatives of the components $\frac{\partial_{EL}L}{\delta \phi^a}$ of the Euler-Lagrange form (prop. ).

(109)

\begin{aligned} \Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}} & \coloneqq \Omega^{0,0}_{\Sigma,cp}(E,\varphi) / \left( \frac{d^k}{ d x^{\mu_1} \cdots d x^{\mu_l}} \frac{\delta_{EL} L}{\delta \phi^a} \right)_{ { a \in \{1, \cdots, s\} } \atop { { l \in \{1, \cdots, k\} } \atop { \mu_r \in \{0, \cdots, p\} } } } \\ & = C^\infty_{cp}(\Sigma)\left[ \left[ (\phi^a - \varphi^a ), (\phi^a_{,\mu} -\varphi^a_{,\mu}), \cdots, (\phi^a_{,\mu_1 \cdots \mu_k}- \varphi^a_{,\mu_1 \cdots \mu_k}) \right] \right] / \left( \frac{d^k}{ d x^{\mu_1} \cdots d x^{\mu_l}} \frac{\delta_{EL} L}{\delta \phi^a} \right)_{ { a \in \{1, \cdots, s\} } \atop { { l \in \{1, \cdots, k\} } \atop { \mu_r \in \{0, \cdots, p\} } } } \end{aligned} \,.

Finally the local observables restricted to the infinitesimal neighbourhood is the module

(110)

LocObs_\Sigma(E,\varphi) \;\simeq\; \left( \Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}} \langle dvol_\Sigma \rangle \right)/(im(d)) \,.

The space of local observables in def. is the quotient of a formal power series algebra by the components of the Euler-Lagrange form and by the image of the horizontal spacetime de Rham differential. It is convenient to also conceive of the components of the Euler-Lagrange form as the image of a differential, for then the algebra of local observables obtaines a cohomological interpretation, which will lend itself to computation. This differential, whose image is the components of the Euler-Lagrange form, is called the BV-differential. We introduce this now first (def. below) in a direct ad-hoc way. Further below we discuss the conceptual nature of this differential as part of the construction of the reduced phase space as a derived critical locus (example below).

Definition

(local BV-complex of ordinary Lagrangian density)

In correspondence with def. , write

\Gamma_{\Sigma,cp}(T_\Sigma J^\infty_\Sigma E,\varphi) \simeq \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi) \;\in\; \Omega^{0,0}_{\Sigma,cp}(E) Mod

for the restriction of vertical vector fields on the jet bundle to the fiberwise infinitesimal neighbourhood (example ) of $\Sigma \times {\varphi}$ .

Now we regard this as a graded module over $\Omega^{0,0}_{\Sigma,cp}(E,\varphi)$ (108) concentrated in degree $-1$ :

\Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi)[-1] \;\in\; \Omega^{0,0}_{\Sigma,cp}(E) Mod^{\mathbb{Z}} \,.

This is called the module of antifields corresponding the given type of fields encoded by $E$ .

If the field bundle is a trivial vector bundle (example ) with field coordinates $(\phi^a)$ , then we write

(111)

\phi^{\ddagger}_{a,\mu_1 \cdots \mu_l} \;\coloneqq\; \left( \partial_{(\phi^a_{\mu_1 \cdots \mu_l})} \right)[-1] \;\in\; \Gamma_{\Sigma,cp}(T_\Sigma J^\infty_\Sigma E,\varphi)[-1]

for the vector field generator that takes derivatives along $\partial_{\phi^a_{,\mu_1 \cdots \mu_k}}$ , but regarded now in degree -1.

Evaluation of vector fields in thelocal BV-complex total spacetime derivatives $\frac{d^l}{d x^{\mu_1} \cdots d x^{\mu_l}} \delta_{EL}\mathbf{L} \in \Omega^{p,0}_\Sigma(E) \wedge \delta \Omega^{0,0}_\Sigma(E)$ of the variational derivative (prop. ) yields a linear map over $\Omega^{\bullet,\bullet}_{\Sigma,cp}(E,\varphi)$ (109)

\iota_{(-)}\delta_{EL} \mathbf{L} \;\colon\; \Gamma_{\Sigma,cp}( J^\infty_\Sigma T_\Sigma E,\varphi)[-1] \longrightarrow \Omega^{p+1,0}_{\Sigma,cp}(E,\varphi) \,.

If we use the volume form $dvol_\Sigma$ on spacetime $\Sigma$ to induce an identification

\Omega^{p+1,0}_\Sigma(E) \;\simeq\; C^\infty(J^\infty_\Sigma(E))\langle dvol_\sigma\rangle

with respect to which the Lagrangian density decomposes as

\mathbf{L} = L dvol_\Sigma

then this is a $\Omega^{0,0}_\sigma(E,\varphi)$ -linear map of the form

\iota_{(-)}{\delta L_{EL}} \;\colon\; \Gamma_{\Sigma,cp}^{ev}(T_\Sigma E,\varphi)[-1] \longrightarrow \Omega^{0,0}_{\Sigma,cp}(E,\varphi) \,.

In the special case that the field bundle $E \overset{fb}{\to} \Sigma$ is a trivial vector bundle (example ) with field coordinates $(\phi^a)$ so that the Euler-Lagrange form has the coordinate expansion

\delta_{EL} \mathbf{L} \;=\; \frac{\delta_{EL}\mathbf{L}}{\delta \phi^a} \delta \phi^a

then this map is given on the antifield basis elements (111) by

\iota_{(-)} {\delta L_{EL}} \;\colon\; \phi^{\ddagger}_{a,\mu_1 \cdots \mu_l} \;\mapsto\; \frac{d^l}{d x^{\mu_1} \cdots d x^{\mu_l}} \frac{\delta_{EL} L}{\delta \phi^a} \,.

Consider then the graded symmetric algebra

C^\infty( J^\infty_\Sigma((T_\Sigma E)[-1] \times_\Sigma E, \varphi) ) \;\coloneqq\; Sym_{\Omega^{0,0}_{\Sigma,cp}(E,\varphi)}\left( \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi)[-1] \right)

which is generated over $\Omega^{0,0}_{\Sigma,cp}(E,\varphi)$ from the module of vector fields in degree -1.

If we think of a single vector field as a fiber-wise linear function on the cotangent bundle, and of a multivector field similarly as a multilinear function on the cotangent bundle, then we may think of this as the algebra of functions on the infinitesimal neighbourhood (example ) of $\varphi$ inside the graded manifold $(T_\Sigma E)[-1] \times_\Sigma E$ .

Let now

(112)

s_{BV} \;\colon\; C^\infty( J^\infty_\Sigma((T_\Sigma E)[-1] \times_\Sigma E, \varphi) ) \;\longrightarrow\; C^\infty( J^\infty_\Sigma((T_\Sigma E)[-1] \times_\Sigma E, \varphi) )

be the unique extension of the linear map $\iota_{(-)}{\delta_{EL} L}$ to an $\mathbb{R}$ -linear derivation of degree +1 on this algebra.

The resulting differential graded-commutative algebra over $\mathbb{R}$

\Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}_{BV}} \;\coloneqq\; \left( C^\infty( J^\infty_\Sigma((T_\Sigma E)[-1] \times_\Sigma E, \varphi) ) \,,\, s_{BV} \right)

is called the local BV-complex of the Lagrangian field theory at the background solution $\varphi$ . This is the CE-algebra of the infintiesimal neighbourhood of $\Sigma \times \{\varphi\}$ in the derived prolonged shell (def. ). In this case, in the absence of any explicit infinitesimal gauge symmetries, this is an example of a Koszul complex.

There are canonical homomorphisms of dgc-algebras, one from the algebra of functions $\Omega^{0,0}_{\Sigma,cp}(E,\varphi)$ on the infinitesimal neighbourhood of the background solution $\varphi$ to the local BV-complex and from there to the local observables on the neighbourhood of the background solution $\varphi$ (109), all considered with compact spacetime support:

\Omega^{0,0}_{\Sigma,cp}(E,\varphi) \longrightarrow \Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}_{BV}} \longrightarrow \Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}}

such that the composite is the canonical quotient coprojection.

Similarly we obtain a factorization for the entire variational bicomplex:

(113)

\Omega^{\bullet,\bullet}_\Sigma(E,\varphi) \longrightarrow \Omega^{\bullet,\bullet}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}} \longrightarrow \Omega^{\bullet,\bullet}_\Sigma(E,\varphi)\vert_{\mathcal{E}} \,,

where $\Omega^{\bullet,\bullet}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}}$ is now triply graded, with three anti-commuting differentials $d$ $\delta$ and $s_{BV}$ .

By construction this is now such that the local observables (def. ) are the cochain cohomology of this complex in horizontal form degree p+1, vertical degree 0 and BV-degree 0:

LocObs_\Sigma(E) \simeq \Omega^{p+1,0}_{\Sigma,cp}(E)/(im(s_{BV} + d)) \,.

$\,$

States

We introduce the basics of quantum probability in terms of states defined as positive linear maps on star-algebras of observables.

Definition

(star algebra)

A star ring is a ring $R$ equipped with

a linear map

$(-)^\ast \;\colon\; R \longrightarrow R$

such that

(involution) $((-)^\ast)^\ast = id$ ;
(antihomomorphism)
1. $(a b)^\ast = b^\ast a^\ast$ for all $a,b \in R$
2. $1^\ast = 1$ .

A homomorphism of star-rings

f \;\colon\; (R_1, (-)^\ast) \longrightarrow (R_2, (-)^\dagger)

is a homomorphism of the underlying rings

f \;\colon\; R_1 \longrightarrow R_2

which respects the star-involutions in that

f \circ (-)^\ast \;=\; (-)^\dagger \circ f \,.

A star algebra $\mathcal{A}$ over a commutative star-ring $R$ in an associative algebra $\mathcal{A}$ over $R$ such that the inclusion

R \hookrightarrow \mathcal{A}

is a star-homomorphism.

Examples

(complex number-valued observables are star-algebra under pointwise product and pointwise complex conjugation)

The complex numbers $\mathbb{C}$ carry the structure of a star-ring (def. ) with star-operation given by complex conjugation.

Given any space $X$ , then the algebra of functions on $X$ with values in the complex numbers carries the structure of a star-algebra over the star-ring $\mathbb{C}$ (def. ) with star-operation given by pointwise complex conjugation in the complex numbers.

In particular for $(E,\mathbf{L})$ a Lagrangian field theory (def. ) then its on-shell observables $Obs(E,\mathbf{L})$ (def. ) carry the structure of a star-algebra this way.

Definition

(state on a star-algebra)

Given a star algebra $(\mathcal{A}, (-)^\ast)$ (def. ) over the star-ring of complex numbers (def. ) a state is a function to the complex numbers

\langle -\rangle \;\colon\; Obs_\Sigma \longrightarrow \mathbb{C}

such that

(linearity) this is a complex-linear map:

$\left\langle c_1 A_1 + c_2 A_2 \right\rangle \;=\; c_1 \langle A_1 \rangle + c_2 \langle A_2 \rangle$
(positivity) for all $A \in Obs$ we have that

$\langle A^\ast A \rangle \geq 0 \;\in\; \mathbb{R}$

where on the left $A^\ast$ is the star-operation from
(normalization)

$\langle 1 \rangle \;=\; 1 \,.$

(e.g. Bordemann-Waldmann 96, Fredenhagen-Rejzner 12, def. 2.4, Khavkine-Moretti 15, def. 6)

Remark

(probability theoretic interpretation of state on a star-algebra)

A star algebra $\mathcal{A}$ (def. ) equipped with a state $\mathcal{A} \overset{\langle -\rangle}{\longrightarrow} \mathbb{C}$ (def. ) is also called a quantum probability space, at least when $\mathcal{A}$ is in fact a von Neumann algebra.

For this interpretation we think of each element $A \in \mathcal{A}$ as an observable as in example and of the state as assigning expectation values.

Remark

(states form a convex set)

For $\mathcal{A}$ a unital star-algebra (def. ), the set of states on $\mathcal{A}$ according to def. is naturally a convex set: For $\langle (-)\rangle_1, \langle - \rangle_2 \colon \mathcal{A} \to \mathbb{C}$ two states then for every $p \in [0,1] \subset \mathbb{R}$ also the linear combination

\array{ \mathcal{A} &\overset{p \langle (-)\rangle_1 + (1-p) \langle (-)\rangle_2}{\longrightarrow}& \mathbb{C} \\ A &\mapsto& p \langle A \rangle_1 + (1-p) \langle A \rangle_2 }

is a state.

Definition

(pure state)

A state $\rho \colon \mathcal{A} \to \mathbb{C}$ on a unital star-algebra (def. ) is called a pure state if it is extremal in the convex set of all states (remark ) in that an identification

\langle (- )\rangle = p \langle (-)\rangle_1 + (1-p) \langle (-)\rangle_2

for $p \in (0,1)$ implies that $\langle (-) \rangle_1 = \langle (-)\rangle_2$ (hence $= \langle (-)\rangle$ ).

Proposition

(classical probability measure as state on measurable functions)

For $\Omega$ classical probability space, hence a measure space which normalized total measure $\int_\Omega d\mu = 1$ , let $\mathcal{A} \cloneqq L^1(\Omega)$ be the algebra of Lebesgue measurable functions with values in the complex numbers, regarded as a star algebra (def. ) by pointwise complex conjugation as in example . Then forming the expectation value with respect to $\mu$ defines a state (def. ):

\array{ L^1(\Omega) &\overset{\langle (-)\rangle_\mu}{\longrightarrow}& \mathbb{C} \\ A &\mapsto& \int_\Omega A d\mu }

Example

(elements of a Hilbert space as pure states on bounded operators)

Let $\mathcal{H}$ be a complex separable Hilbert space with inner product $\langle -,-\rangle$ and let $\mathcal{A} \coloneqq \mathcal{B}(\mathcal{H})$ be the algebra of bounded operators, regarded as a star algebra (def. ) under forming adjoint operators. Then for every element $\psi \in \mathcal{H}$ of unit norm $\langle \psi,\psi\rangle = 1$ there is the state (def. ) given by

\array{ \mathcal{B}(\mathcal{H}) &\overset{\langle (-)\rangle_\psi}{\longrightarrow}& \mathbb{C} \\ A &\mapsto& \langle \psi \vert\, A \, \vert \psi \rangle &\coloneqq& \langle \psi, A \psi \rangle }

These are pure states (def. ).

More general states in this case are given by density matrices.

Theorem

(GNS construction)

Given

a star-algebra, $\mathcal{A}$ (def. );
a state, $\langle (-)\rangle \;\colon\; \mathcal{A} \to \mathbb{C}$ (def. )

there exists

a star-representation

$\pi \;\colon\; \mathcal{A} \longrightarrow End(\mathcal{H})$

of $\mathcal{A}$ on some Hilbert space $\mathcal{H}$
a cyclic vector $\psi \in \mathcal{H}$

such that $\langle (-)\rangle$ is the state corresponding to $\psi$ via example , in that

\begin{aligned} \langle A \rangle & = \langle \psi \vert\, A \, \vert \psi \rangle \\ & \coloneqq \langle \psi , \pi(A) \psi \rangle \end{aligned}

for all $A \in \mathcal{A}$ .

(Khavkine-Moretti 15, theorem 1)

Definition

(classical state)

Given a Lagrangian field theory $(E,\mathbf{L})$ (def. ) then a classical state is a state on the star algebra (def. ) of on-shell observables (example ):

\langle -\rangle \;\colon\; Obs(E,\mathbf{L}) \longrightarrow \mathbb{C} \,.

Below we consider quantum states. These are defined just as in def. , only that now the algebra of observables is equipped with another product, which changes the meaning of the product expression $A^\ast A$ and hence the positivity condition in def. .

$\,$

This concludes our discussion of observables. In the next chapter we consider the construction of the covariant phase space and of the Poisson-Peierls bracket on observables.

Phase space

In this chapter we discuss these topics:

Covariant phase space
BV-Resolution of the covariant phase space
Hamiltonian local observables

$\,$

It might seem that with the construction of the local observables (def. ) on the on-shell space of field histories (prop. ) the field theory defined by a Lagrangian density (def. ) has been completely analyzed: This data specifies, in principle, which field histories are realized, and which observable properties these have.

In particular, if the Euler-Lagrange equations of motion (def. ) admit Cauchy surfaces (def. below), i.e. spatial codimension 1 slices of spacetimes such that a field history is uniquely specified already by its restriction to the infinitesimal neighbourhood of that spatial slice, then a sufficiently complete collection of local observables whose spacetime support (def. ) covers that Cauchy surface allows to predict the evolution of the field histories through time from that Cauchy surface.

This is all what one might think a theory of physical fields should accomplish, and in fact this is essentially all that was thought to be required of a theory of nature from about Isaac Newton‘s time to about Max Planck’s time.

But we have seen that a remarkable aspect of Lagrangian field theory is that the de Rham differential of the local Lagrangian density $\mathbf{L}$ (def. ) decomposes into two kinds of variational differential forms (prop. ), one of which is the Euler-Lagrange form which determines the equations of motion (50).

However, there is a second contribution: The presymplectic current $\Omega_{BFV} \in \Omega^{p,2}_{\Sigma}(E)$ (55). Since this is of horizontal degree $p$ , its transgression (def. ) implies a further structure on the space of field histories restricted to spacetime submanifolds of dimension $p$ (i.e. of spacetime “codimension 1”). There may be such submanifolds such that this restriction to their infinitesimal neighbourhood (example ) does not actually change the on-shell space of field histories, these are called the Cauchy surfaces (def. below).

By the Hamiltonian Noether theorem (prop. ) the presymplectic current induces infinitesimal symmetries acting on field histories and local observables, given by the local Poisson bracket (prop. ). The transgression (def. ) of the presymplectic current to these Cauchy surfaces yields the corresponding infinitesimal symmetry group acting on the on-shell field histories, whose Lie bracket is the Poisson bracket pairing on on-shell observables (example below). This data, the on-shell space of field histories on the infinitesimal neighbourhood of a Cauchy surface equipped with infinitesimal symmetry exhibited by the Poisson bracket is called the phase space of the theory (def. ) below.

In fact if enough Cauchy surfaces exist, then the presymplectic forms associated with any one choice turn out do agree after pullback to the full on-shell space of field histories, exhibiting this as the covariant phase space of the theory (prop. below) which is hence manifestly independent of aa choice of space/time splitting. Accordingly, also the Poisson bracket on on-shell observables exists in a covariant form; for free field theories with Green hyperbolic equations of motion (def. ) this is called the Peierls-Poisson bracket (theorem below). The integral kernel for this Peierls-Poisson bracket is called the causal propagator (prop. ). Its “normal ordered” or “positive frequency component”, called the Wightman propagator (def. below) as well as the corresponding time-ordered variant, called the Feynman propagator (def. below), which we discuss in detail in Propagators below, control the causal perturbation theory for constructing perturbative quantum field theory by deforming the commutative pointwise product of on-shell observables to a non-commutative product governed to first order by the Peierls-Poisson bracket.

To see how such a deformation quantization comes about conceptually from the phase space strucure, notice from the basic principles of homotopy theory that given any structure on a space which is invariant with respect to a symmetry group acting on the space (here: the presymplectic current) then the true structure at hand is the homotopy quotient of that space by that symmetry group. We will explain this further below. This here just to point out that the homotopy quotient of the phase space by the infinitesimal symmetries of the presymplectic current is called the symplectic groupoid and that the true algebra of observables is hence the (polarized) convolution algebra of functions on this groupoid. This turns out to the “algebra of quantum observables” and the passage from the naive local observables on presymplectic phase space to this non-commutative algebra of functions on its homotopy quotient to the symplectic groupoid is called quantization. This we discuss in much detail below; for the moment this is just to motivate why the covariant phase space is the crucial construction to be extracted from a Lagrangian field theory.

$\,$

\array{ \left\{ \array{ \text{on-shell space} \\ \text{ of field histories} \\ \text{restricted to} \\ \text{Cauchy surface} } \right\} &\overset{\array{ \text{homotopy} \\ \text{quotient} \\ \text{by} \\ \text{infinitesimal} \\ \text{symmetries} }}{\longrightarrow} & \left\{ \array{ \text{covariant} \\ \text{phase space} } \right\} &\overset{ \array{\text{Lie algebra} \\ \text{of functions} } }{\longrightarrow}& \left\{ \array{ \text{Poisson algebra} \\ \text{of observables} } \right\} \\ & \searrow & \Big\downarrow{}^\mathrlap{{\text{Lie integration}}} && {}^{\mathllap{quantization}}\Big\downarrow \\ && \left\{ \array{ \text{symplectic} \\ \text{groupoid} } \right\} & \overset{ \array{ \text{polarized} \\ \text{convolution} \\ \text{algebra} } }{\longrightarrow}& \left\{ \array{ \text{quantum algebra} \\ \text{of observables} } \right\} }

$\,$

Covariant phase space

Definition

(Cauchy surface)

Given a Lagrangian field theory $(E, \mathbf{L})$ on a spacetime $\Sigma$ (def. ), then a Cauchy surface is a submanifold $\Sigma_p \hookrightarrow \Sigma$ (def. ) such that the restriction map from the on-shell space of field histories $\Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}$ (67) to the space $\Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0}$ (68) of on-shell field histories restricted to the infinitesimal neighbourhood of $\Sigma_p$ (example ) is an isomorphism:

(114)

\Gamma_\Sigma(E)_{\delta_{EL} \mathbf{L} = 0 } \underoverset{\simeq}{(-)\vert_{N_\Sigma \Sigma_p}}{\longrightarrow} \Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0} \,.

Example

(normally hyperbolic differential operators have Cauchy surfaces)

Given a Lagrangian field theory $(E, \mathbf{L})$ on a spacetime $\Sigma$ (def. ) whose equations of motion (def. ) are given by a normally hyperbolic differential operator (def. ), then it admits Cauchy surfaces in the sense of Def. .

(e.g. Bär-Ginoux-Pfäffle 07, section 3.2)

Definition

(phase space associated with a Cauchy surface)

Given a Lagrangian field theory $(E, \mathbf{L})$ on a spacetime $\Sigma$ (def. ) and given a Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ (def. ) then the corresponding phase space is

the super smooth set $\Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0}$ (68) of on-shell field histories restricted to the infinitesimal neighbourhood of $\Sigma_p$ ;
equipped with the differential 2-form (as in def. )

(115) $\omega_{\Sigma_p} \;\coloneqq\; \tau_{\Sigma_p}\left(\Omega_{BFV}\right) \;\in\; \Omega^2\left( \Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0} \right)$

which is the distributional transgression (def. ) of the presymplectic current $\Omega_{BFV}$ (def. ) to $\Sigma_p$ .

This $\omega_{\Sigma_p}$ is a closed differential form in the sense of def. , due to prop. and using that $\Omega_{BFV} = \delta \Theta_{BFV}$ is closed by definition (55). As such this is called the presymplectic form on the phase space.

Example

(evaluation of transgressed variational form on tangent vectors for free field theory)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) which is free (def. ) hence whose field bundle is a some smooth super vector bundle (example ) and whose Euler-Lagrange equation of motion is linear. Then the synthetic tangent bundle (def. ) of the on-shell space of field histories $\Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0}$ (67) with spacelike compact support (def ) is canonically identified with the Cartesian product of this super smooth set with itself

T\left( \Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0} \right) \;\simeq\; \left(\Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0}\right) \times \left(\Gamma_{\Sigma,scp}(E)_{\delta_{EL} \mathbf{L} = 0}\right) \,.

With field coordinates as in example , we may expand the presymplectic current as

\Omega_{BFV} = \left(\Omega_{BFV}\right)^{\mu_1, \cdots, \mu_{k_1}, \nu_1, \cdots, \nu_{k_2}, \kappa}_{a_1 a_2} \delta \phi^{a_1}_{\mu_1 \cdots \mu_k} \wedge \delta \phi^{a_2}_{\nu_1 \cdots \nu_{k_2}} \wedge \iota_{\partial_\kappa} dvol_\Sigma \,,

where the components $(\Omega_{BFV})_{a_1 a_2}^{\mu_1, \cdots, \mu_{k_1}, \nu_1, \cdots, \nu_{k_2}, \kappa}$ are smooth functions on the jet bundle.

Under these identifications the value of the presymplectic form $\omega_{\Sigma_p}$ (115) on two tangent vectors $\vec \Phi_1, \vec \Phi_2 \in \Gamma_{\Sigma,scp}(E)$ at a point $\Phi \in \Gamma_{\Sigma,scp}(E)$ is

\omega_{\Sigma_p}(\vec \Phi_1, \vec \Phi_2) \;=\; \underset{\Sigma_p}{\int} \left(\Omega_{BFV}\right)^{\mu_1, \cdots, \mu_{k_1}, \nu_1, \cdots, \nu_{k_2}, \kappa}_{a_1 a_2}(\Phi(x)) \left( \frac{\partial}{\partial x^{\mu_1}} \cdots \frac{\partial}{\partial x^{\mu_{k_1}}} \vec \Phi_1(x) \right) \left( \frac{\partial}{\partial x^{\nu_1}} \cdots \frac{\partial}{\partial x^{\nu_{k_2}}} \vec \Phi_2(x) \right) \, \iota_{\partial_\kappa} dvol_\Sigma(x) \,.

Example

(presymplectic form for free real scalar field)

Consider the Lagrangian field theory for the free real scalar field from example .

Under the identification of example the presymplectic form on the phase space (def. ) associated with a Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ is given by

\begin{aligned} \omega_{\Sigma_p}(\vec \Phi_1, \vec\Phi_2) & = \int_{\Sigma_{p}} \left( \frac{\partial \vec \Phi_1}{\partial x^\mu}(x) \vec \Phi_2(x) - \vec \Phi_1(x) \frac{\partial \vec \Phi_2}{\partial x^\mu}(x) \right) \eta^{\mu \nu} \iota_{\partial_\mu} dvol_{\Sigma_{p}}(x) \\ & = \underset{\Sigma_p}{\int} K(\vec \Phi_1, \vec \Phi_2) \,. \end{aligned}

Here the first equation follows via example from the form of $\Omega_{BFV}$ from example , while the second equation identifies the integrand as the witness $K$ for the formally self-adjointness of the Klein-Gordon equation from example .

Example

(presymplectic form for free Dirac field)

Consider the Lagrangian field theory of the free Dirac field (example ).

Under the identification of example the presymplectic form on the phase space (def. ) associated with a Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ is given by

\begin{aligned} \omega_{\Sigma_p}(\theta_1 \vec \Psi_1, \theta_2 \vec\Psi_2) & = \int_{\Sigma_{p}} \left( \overline{\theta_1 \vec \psi_1}\gamma^\mu \left( \theta_2 \vec \Psi_2 \right) \right) \iota_{\partial_\mu} dvol_{\Sigma_{p}}(x) \\ & = \underset{\Sigma_p}{\int} K(\vec \Phi_1, \vec \Phi_2) \,. \end{aligned}

Proposition

(covariant phase space)

Consider $(E, \mathbf{L})$ a Lagrangian field theory on a spacetime $\Sigma$ (def. ).

Let

\Sigma_{tra} \overset{tra}{\hookrightarrow} \Sigma

be a submanifold with two boundary components $\partial \Sigma_{tra} = \Sigma_{in} \sqcup \Sigma_{out}$ , both of which are Cauchy surfaces (def. ).

Then the corresponding inclusion diagram

\array{ && \Sigma_{tra} \\ & {}^{\mathllap{in}}\nearrow && \nwarrow^{\mathrm{out}} \\ \Sigma_{in} && && \Sigma_{out} }

induces a Lagrangian correspondence between the associated phase spaces (def. )

\array{ && \Gamma_{\Sigma_{tra}}(E)_{\delta_{EL} \mathbf{L} = 0} \\ & {}^{\mathllap{ (-)\vert_{in} }}\swarrow && \searrow^{\mathrlap{ (-)\vert_{out} }} \\ \Gamma_{\Sigma^{(in)}}(E)_{\delta_{EL}\mathbf{L}= 0} && && \Gamma_{\Sigma^{(out)}}(E)_{\delta_{EL}\mathbf{L}= 0} \\ & {}_{\mathllap{\omega_{in}}}\searrow && \swarrow_{\mathrlap{\omega_{out}}} \\ && \mathbf{\Omega}^{2} }

in that the pullback of the two presymplectic forms (115) coincides on the space of field histories:

\left( (-)\vert_{in}\right)^\ast\left( \omega_{in}\right) \;=\; \left( (-)\vert_{out} \right)^\ast \left( \omega_{out} \right) \phantom{AAAA} \in \Omega^2 \left( \Gamma_{\Sigma_{tra}}(E)_{\delta_{EL} \mathbf{L} = 0} \right) \,.

Hence there is a well defined presymplectic form

\omega \in \Omega^2\left( \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L}} = 0 \right)

on the genuine space of field histories, given by $\omega \coloneqq i^\ast \omega_{\Sigma_p}$ for any Cauchy surface $\Sigma_p \overset{i}{\hookrightarrow} \Sigma$ . This presymplectic smooth space

\left( \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L}} \,,\, \omega \right)

is therefore called the covariant phase space of the Lagrangian field theory $(E,\mathbf{L})$ .

Proof

By prop. the total spacetime derivative $d \Omega_{BFV}$ of the presymplectic current vanishes on-shell:

d \Omega_{BFV} = - \delta \delta_{EL} \mathbf{L}

in that the pullback (def. ) along the shell inclusion $\mathcal{E} \overset{i_{\mathcal{E}}}{\hookrightarrow} J^\infty_\Sigma(E)$ (51) vanishes:

\begin{aligned} (i_{\mathcal{E}})^\ast \left( d \Omega_{BFV} \right) & = - (i_{\mathcal{E}})^\ast \left( \delta \delta_{EL} \mathcal{L} \right) \\ & = - \delta \underset{ = 0 }{ \underbrace{ (i_{\mathcal{E}})^\ast \left( \delta_{EL} \mathbf{L} \right) } } \\ & = 0 \end{aligned}

This implies that the transgression of $d \Omega_{BFV}$ to the on-shell space of field histories $\Gamma_{\Sigma_{tra}}(E)_{\delta_{EL}\mathbf{L} = 0}$ vanishes (since by definition (65) that involves pulling back through the shell inclusion)

\tau_{\Sigma_{tra}}(d \Omega_{BFV}) = 0 \,.

But then the claim follows with prop. :

\begin{aligned} 0 & = \tau_{\Sigma_{tra}}(d \Omega_{BFV}) \\ & = ((-)\vert_{\Sigma_{tra}})^\ast \tau_{\partial \Sigma_{tra}} \Omega_{BFV} \,. \end{aligned}

Theorem

(polynomial Poisson bracket on covariant phase space – the Peierls bracket)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) such that

it is a free field theory (def. )
whose Euler-Lagrange equation of motion $P \Phi = 0$ (def. ) is
1. formally self-adjoint or formally anti self-adjoint (def. ) such that
  - the integral over the witness $K$ (33) is the
  presymplectic form (115): $\omega_{\Sigma_p} = \underset{\Sigma_p}{\int} K$ ;
2. Green hyperbolic (def. ).

Write

\mathrm{G}_P \;\colon\; LinObs(E_{scp},\mathbf{L})^{reg} \overset{\mathrm{G}_P}{\longrightarrow} \Gamma_{\Sigma,scp}(E)_{\delta_{EL}\mathbf{L} = 0}

for the linear map from regular linear field observables (def. ) to on-shell field histories with spatially compact support (def. ) given under the identification (104) by the causal Green function $\mathrm{G}_P$ (def. ).

Then for every Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ (def. ) this map is an inverse to the presymplectic form $\omega_{\Sigma_p}$ (def. ) in that, under the identification of tangent vectors to field histories from example , we have that the composite

(116)

\array{ \omega_{\Sigma_p}(\mathrm{G}_P(-),(-)) \;=\; ev &\colon& LinObs(E_{scp},\mathbf{L})^{reg} &\otimes& \Gamma_{\Sigma,scp}(E) &\longrightarrow& \mathbb{C} \\ && (A &,& \Phi) &\mapsto& A(\Phi) }

equals the evaluation map of observables on field histories.

This means that for every Cauchy surface $\Sigma_p$ the presymplectic form $\omega_{\Sigma_p}$ restricts to a symplectic form on regular linear observables. The corresponding Poisson bracket is

\left\{ -,- \right\}_{\Sigma_p} \;\coloneqq\; \omega_{\Sigma_p}(\mathrm{G}_P(-), \mathrm{G}_P(-)) \;\;\colon\;\; LinObs(E_{scp},\mathbf{L})^{reg} \otimes LinObs(E_{scp},\mathbf{L})^{reg} \longrightarrow \mathbb{R} \,.

Moreover, equation (116) implies that this is the covariant Poisson bracket in the sense of the covariant phase space (def. ) in that it does not actually depend on the choice of Cauchy surface.

An equivalent expression for the Poisson bracket that makes its independence from the choice of Cauchy surface manifest is the $P$ -Peierls bracket given by

(117)

\array{ LinObs(E_{scp},\mathbf{L})^{reg} \otimes LinObs(E_{scp},\mathbf{L})^{reg} &\overset{\{-,-\}}{\longrightarrow}& \mathbb{R} \\ (\alpha^\ast, \beta^\ast) &\mapsto& \underset{\Sigma}{\int} \mathrm{G}(\alpha^\ast) \cdot \beta^\ast \, dvol_\Sigma }

where on the left $\alpha^\ast, \beta^\ast \in \Gamma_{\Sigma,cp}(E^\ast) \simeq LinObs(E_{scp},\mathbf{L})^{reg}$

Hence under the given assumptions, for every Cauchy surface the Poisson bracket associated with that Cauchy surface equals the invariantly (“covariantly”) defined Peierls bracket

\{-,-\}_{\Sigma_p} = \{-,-\} \,.

Finally this means that in terms of the causal propagator $\Delta$ (95) the covariant Peierls-Poisson bracket is given in generalized function-notation by

(118)

\{\alpha^\ast, \beta^\ast\} \;=\; \underset{\Sigma}{\int} \underset{\Sigma}{\int} \alpha^\ast(x) \cdot \Delta(x,y) \cdot \beta^\ast(y) \, dvol_\Sigma(x)\, dvol_\Sigma(y)

Therefore, while the point-evaluation field observables $\mathbf{\Phi}^a(x)$ (def. ) are not themselves regular observables (def. ), the Peierls-Poisson bracket (118) is induced from the following distributional bracket between them

\left\{ \mathbf{\Phi}^a(x) , \mathbf{\Phi}^b(y) \right\} \;=\; \Delta^{a b}(x,y)

with the causal propagator (95) on the right, in that with the identification (105) the Peierls-Poisson bracket on regular linear observables arises as follows:

\begin{aligned} \left\{ \underset{\Sigma}{\int} \alpha^\ast_a(x) \mathbf{\Phi}^a(x) \, dvol_\Sigma(x) \,,\, \underset{\Sigma}{\int} \beta^\ast_b(y) \mathbf{\Phi}^b(y) \, dvol_\Sigma(y) \right\} & = \underset{\Sigma}{\int} \underset{\Sigma}{\int} \alpha^\ast_a(x) \underset{= \Delta^{a b}(x,y)}{ \underbrace{ \left\{ \mathbf{\Phi}^a(x), \mathbf{\Phi}^b(y) \right\} } } \beta^\ast_b(y) \, dvol_\Sigma(x)\, dvol_\Sigma(y) \\ & = \underset{\Sigma}{\int} \underset{\Sigma}{\int} \alpha^\ast_a(x) \Delta^{a b}(x,y) \beta^\ast_b(y) \, dvol_\Sigma(x)\, dvol_\Sigma(y) \end{aligned}

(Khavkine 14, lemma 2.5)

Proof

Consider two more Cauchy surfaces $\Sigma_p^\pm \hookrightarrow I^\pm(\Sigma) \hookrightarrow \Sigma$ , in the future $I^+$ and in the past $I^-$ of $\Sigma$ , respectively. Choose a partition of unity on $\Sigma$ consisting of two elements $\chi^\pm \in C^\infty(\Sigma)$ with support bounded by these Cauchy surfaces: $supp(\chi_\pm) \subset I^\pm(\Sigma^{\mp})$ .

Then define

(119)

P_\chi \;\colon\; \Gamma_{\Sigma,scp}(E) \longrightarrow \Gamma_{\Sigma,cp}(E^\ast)

(120)

\begin{aligned} P_\chi(\Phi) & \coloneqq \phantom{-} P(\chi_+ \Phi) \\ & = - P(\chi_- \Phi) \,. \end{aligned}

Notice that the support of the partitioned field history is in the compactly sourced future/past cone

(121)

\chi_\pm \Phi \;\in\; \Gamma_{\Sigma,\pm cp}(E)

since $\Phi$ is supported in the compactly sourced causal cone, but that $P(\chi_\pm \Phi)$ indeed has compact support as required by (119): Since $P(\Phi) = 0$ , by assumption, the support is the intersection of that of $\Phi$ with that of $d \chi_\pm$ , and the first is spacelike compact by assumption, while the latter is timelike compact, by definition of partition of unity.

Similarly, the equality in (120) holds because by partition of unity $P(\chi_+ \Phi) + P(\chi_-\Phi) = P((\chi_+ + \chi_-)\Phi ) = P(\Phi) = 0$ .

It follows that

(122)

\begin{aligned} \mathrm{G}_P \circ P_\chi (\Phi) & = \left( \mathrm{G}_{P,+} - \mathrm{G}_{P,-} \right) P_\chi (\Phi) \\ & = \underset{ = \chi_+ \Phi}{\underbrace{\mathrm{G}_{P,+} P(\chi_+ \Phi)}} + \underset{ = \chi_- \Phi }{\underbrace{\mathrm{G}_{P,-} P(\chi_- \Phi)}} \\ & = (\chi_+ + \chi_-)\Phi \\ & = \Phi \,, \end{aligned}

where in the second line we chose from the two equivalent expressions (120) such that via (121) the defining property of the advanced or retarded Green function, respectively, may be applied, as shown under the braces.

(Khavkine 14, lemma 2.1)

Now we apply this to the computation of $\omega_{\Sigma_p}(\mathrm{G}_P(-),-)$ :

\begin{aligned} \omega_{\Sigma_P}(\mathrm{G}_P(\alpha^\ast),\vec \Phi) & = \underset{\Sigma_P}{\int} K(\mathrm{G}_P(\alpha^\ast), \vec \Phi) \\ & = \underset{\Sigma_P}{\int} K(\mathrm{G}_P(\alpha^\ast), \chi_+\vec \Phi) + \underset{\Sigma_P}{\int} K(\mathrm{G}_P(\alpha^\ast), \chi_-\vec \Phi) \\ & = \underset{I^-(\Sigma_P)}{\int} d K(\mathrm{G}_P(\alpha^\ast), \chi_+\vec \Phi) - \underset{I^+(\Sigma_P)}{\int} d K(\mathrm{G}_P(\alpha^\ast), \chi_-\vec \Phi) \\ & = \underset{I^-(\Sigma_P)}{\int} \left( \underset{= 0}{ \underbrace{ P(\mathrm{G}_P(\alpha^\ast))}} \cdot \chi_+\vec \Phi \mp \mathrm{G}_P(\alpha^\ast) \cdot P(\chi_+ \vec \Phi) \right) dvol_\Sigma - \underset{I^+(\Sigma_P)}{\int} \left( \underset{= 0}{ \underbrace{ P(\mathrm{G}_P(\alpha^\ast))}} \cdot \chi_-\vec \Phi \mp \mathrm{G}_P(\alpha^\ast) \cdot P(\chi_- \vec \Phi) \right) dvol_\Sigma \\ & = \mp \left( \underset{I^-(\Sigma_P)}{\int} \mathrm{G}_P(\alpha^\ast) \cdot P(\chi_+ \vec \Phi) dvol_\Sigma + \underset{I^+(\Sigma_P)}{\int} \mathrm{G}_P(\alpha^\ast) \cdot P(\chi_+ \vec \Phi) dvol_\Sigma \right) \\ & = \underset{\Sigma}{\int} \mathrm{G}_P(\alpha^\ast) \cdot P(\chi_+ \vec \Phi) dvol_\Sigma \\ & = \underset{\Sigma}{\int} \alpha^\ast \cdot \mathrm{G}_{P} (P (\chi_+ \vec \Phi)) \\ & = \underset{\Sigma}{\int} \alpha^\ast \cdot \vec \Phi \end{aligned}

Here we computed as follows:

applied the assumption that $\omega_{\Sigma_p}(-,-) = \underset{\Sigma_p}{\int} K(-,-)$ ;
applied the above partition of unity;
used the Stokes theorem (prop. ) for the past and the future of $\Sigma_p$ , respectively;
applied the definition of $d K$ as the witness of the formal (anti-) self-adjointness of $P$ (def. );
used $P\circ \mathrm{G}_p = 0$ on $\Gamma_{\Sigma,cp}(E^\ast)$ (def. ) and used (120);
unified the two integration domains, now that the integrands are the same;
used the formally (anti-)self adjointness of the Green functions (example );
used (122).

Example

(scalar field and Dirac field have covariant Peierls-Poisson bracket)

Examples of free Lagrangian field theories for which the assumptions of theorem are satisfied, so that the covariant Poisson bracket exists in the form of the Peierls bracket include

the free real scalar field (example );
the free Dirac field (example ).

For the free scalar field this is the statement of example with example , while for the Dirac field this is the statement of example with example .

For the free electromagnetic field (example ) the assumptions of theorem are violated, the covariant phase space does not exist. But in the discussion of Gauge fixing, below, we will find that for an equivalent re-incarnation of the electromagnetic field, they are met after all.

$\,$

BV-resolution of the covariant phase space

So far we have discussed the covariant phase space (prop. ) in terms of explicit restriction to the shell. We now turn to the more flexible perspective where a homological resolution of the shell in terms of “antifields” is used (def. ).

Example

(BV-presymplectic current)

Then in the BV-variational bicomplex (113) there exists the BV-presymplectic potential

(123)

\Theta_{BV} \;\coloneqq\; \phi^{\ddagger}_a \delta \phi^a \, dvol_\Sigma \;\in\; \Omega^{p,1}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}}

and the corresponding BV-presymplectic current

\Omega_{BV} ;\in\; \Omega^{p,2}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}}

defined by

\begin{aligned} \Omega_{BV} & \coloneqq \delta \Theta_{BV} \\ & = \delta \phi^{\ddagger}_a \wedge \delta \phi^a \wedge dvol_{\Sigma} \end{aligned} \,,

where $(\phi^a)$ are the given field coordinates, $\phi^{\ddagger}_a$ the corresponding antifield coordinates (111) and $\frac{\delta_{EL} \mathbf{L}}{\delta \phi^a}$ the corresponding components of the Euler-Lagrange form (prop. ).

Proposition

(local BV-BFV relation)

Then the BV-presymplectic current $\Omega_{BV}$ (def. ) witnesses the on-shell vanishing (prop. ) of the total spacetime derivative of the genuine presymplectic current $\Omega_{BFV}$ (prop. ) in that the total spacetime derivative of $\Omega_{BFV}$ equals the BV-differential $s_{BV}$ of $\Omega_{BV}$ :

d \Omega_{BFV} = s \Omega_{BV} \,.

Hence if $\Sigma_{tra} \hookrightarrow \Sigma$ is a submanifold of spacetime of full dimension $p+1$ with boundary $\partial \Sigma_{tra} = \Sigma_{in} \sqcup \Sigma_{out}$

\array{ && \Sigma_{tra} \\ & {}^{\mathllap{in}}\nearrow && \nwarrow^{\mathrm{out}} \\ \Sigma_{in} && && \Sigma_{out} }

then the pullback of the two presymplectic forms (115) on the incoming and outgoing spaces of field histories, respectively, differ by the BV-differential of the transgression of the BV-presymplectic current:

\left( (-)\vert_{in}\right)^\ast\left( \omega_{in}\right) \;-\; \left( (-)\vert_{out} \right)^\ast \left( \omega_{out} \right) = \tau_{\mathbb{D} \times \Sigma_{tra}} ( s \Omega_{BV} ) \phantom{AAAA} \in \Omega^2 \left( \Gamma_{\Sigma_{tra}}(E)_{\delta_{EL} \mathbf{L} = 0} \right) \,.

This homological resolution of the Lagrangian correspondence that exhibits the “covariance” of the covariant phase space (prop. ) is known as the BV-BFV relation (Cattaneo-Mnev-Reshetikhin 12 (9)).

Proof

For the first statement we compute as follows:

\begin{aligned} s \Omega_{BV} & = - \delta (s \phi^{\ddagger}_a) \delta \phi^a \wedge dvol_{\Sigma} \\ & = - \delta \frac{\delta_{EL}L }{\delta \phi^a} \delta \phi^a dvol_{\Sigma} \\ & = - \delta \delta_{EL}\mathbf{L} \\ & = d \Omega_{BFV} \,, \end{aligned}

where the first steps simply unwind the definitions, and where the last step is prop. .

With this the second statement follows by immediate generalization of the proof of prop. .

Example

(derived presymplectic current of real scalar field)

Consider a Lagrangian field theory (def. ) without any non-trivial implicit infinitesimal gauge transformations (def. ); for instance the real scalar field from example .

Inside its local BV-complex (def. ) we may form the linear combination of

the presymplectic current $\Omega_{BFV}$ (example )
the BF-presymplectic current $\Omega_{BV}$ (example ).

This yields a vertical 2-form

\Omega \;\coloneqq\; \Omega_{BV} + \Omega_{BFV} \;\; \in \Omega^{p,2}_\Sigma(E)\vert_{\mathcal{E}_{BV}}

which might be called the derived presymplectic current.

Similarly we may form the linear combination of 1. the presymplectic potential current $\Theta_{BFV}$ (49)

the BF-presymplectic potential current $\Theta_{BV}$ (123)
the Lagrangian density $\mathbf{L}$ (def. )

hence

\Theta \;\coloneqq\; \Theta_{BV} + \underset{Lepage}{\underbrace{ \Theta_{BFV} + \mathbf{L} }}

(where the sum of the two terms on the right is the Lepage form (56)). This might be called the derived presymplectic potental current.

We then have that

(\delta + (d-s))\Omega \;=\; 0

and in fact

(\delta + (d-s))\Theta \;=\; \Omega \,.

Proof

Of course the first statement follows from the second, but in fact the two contributions of the first statement even vanish separately:

\delta \Omega = 0 \,, \phantom{AAAA} (d-s)\Omega = 0 \,.

The statement on the left is immediate from the definitions, since $\Omega = \delta \Theta$ . For the statement on the right we compute

\begin{aligned} (d - s) (\Omega_{BV} + \Omega_{BFV}) & = \underset{= 0}{\underbrace{d \Omega_{BFV} - \underset{ = 0 }{\underbrace{ s \Omega_{BV}}} }} + \underset{ = 0}{\underbrace{ d \Omega_{BV} - s \Omega_{BFV} }} \\ & = 0 \end{aligned}

Here the first term vanishes via the local BV-BFV relation (prop. ) while the other two terms vanish simply by degree reasons.

Similarly for the second statement we compute as follows:

\begin{aligned} (\delta + (d - s) ) \Theta & = \underset{ = \Omega_{BV} + \Omega_{BFV}}{\underbrace{ \delta (\Theta_{BV} + \Theta_{BFV}) }} + \underset{ = \delta \mathbf{L}}{\underbrace{\mathbf{d} \mathbf{L}}} + \underset{ = 0 }{\underbrace{ (d-s) \mathbf{L} }} + (d-s)(\Theta_{BV} + \Theta_{BFV}) \\ & = \Omega_{BV} + \Omega_{BFV} + \delta \mathbf{L} + \underset{ = 0}{\underbrace{d \Theta_{BV}}} - \underset{ = \delta_{EL} \mathbf{L} }{\underbrace{ s \Theta_{BV}}} + \underset{ = \delta_{EL}\mathbf{L} - \delta \mathbf{L} }{\underbrace{ d \Theta_{BFV} } } - \underset{ = 0 }{\underbrace{ s \Theta_{BFV} }} \\ & = \Omega_{BV} + \Omega_{BFV} \end{aligned} \,.

Here the direct vanishing of various terms is again by simple degree reasons, and otherwise we used the definition of $\Omega$ and, crucially, the variational identity $\delta \mathbf{L} = \delta_{EL}\mathbf{L} - d \Theta_{BFV}$ (49).

$\,$

Hamiltonian local observables

We have defined the local observables (def. ) as the transgressions of horizontal $p+1$ -forms (with compact spacetime support) to the on-shell space of field histories $\Gamma_{\Sigma}(E)_{\delta_{EL}\mathbf{L} = 0}$ over all of spacetime $\Sigma$ . More explicitly, these could be called the spacetime local observables.

But with every choice of Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ (def. ) comes another notion of local observables: those that are transgressions of horizontal $p$ -forms (instead of $p+1$ -forms) to the on-shell space of field histories restricted to the infinitesimal neighbourhood of that Cauchy surface (def. ): $\Gamma_{\Sigma_p}(E)_{\delta_{EL} \mathbf{L} = 0}$ . These are spatially local observables, with respect to the given choice of Cauchy surface.

Among these spatially local observables are the Hamiltonian local observables (def. below) which are transgressions specifically of the Hamiltonian forms (def. ). These inherit a transgression of the local Poisson bracket (prop. ) to a Poisson bracket on Hamiltonian local observables (def. below). This is known as the Peierls bracket (example below).

Definition

(Hamiltonian local observables)

Let $(E, \mathbf{L})$ be a Lagrangian field theory (def. ).

Consider a local observable (def. )

\tau_\Sigma(A) \;\colon\; \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0} \longrightarrow \mathbb{C} \,,

hence the transgression of a variational horizontal $p+1$ -form $A \in \Omega^{p+1,0}_{\Sigma,cp}(E)$ of compact spacetime support.

Given a Cauchy surface $\Sigma_p \hookrightarrow \Sigma$ (def. ) we say that $\tau_\Sigma (A)$ is Hamiltonian if it is also the transgression of a Hamiltonian differential form (def. ), hence if there exists

(H,v) \in \Omega^{p,0}_{\Sigma, Ham}(E)

whose transgression over the Cauchy surface $\Sigma_p$ equals the transgression of $A$ over all of spacetime $\Sigma$ , under the isomorphism (114)

\array{ \Gamma_\Sigma(E)_{\delta_{EL} \mathbf{L} = 0 } && \underoverset{\simeq}{(-)\vert_{N_\Sigma \Sigma_p}}{\longrightarrow} && \Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0} \\ & {}_{\mathllap{\tau_\Sigma}(A)}\searrow && \swarrow_{\mathrlap{ \tau_{\Sigma_p}(H) }} \\ && \mathbf{\Omega}^2 }

Beware that the local observable $\tau_{\Sigma_p}(H)$ defined by a Hamiltonian differential form $H \in \Omega^{p,0}_{\Sigma,Ham}(E)$ as in def. does in general depend not just on the choice of $H$ , but also on the choice $\Sigma_p$ of the Cauchy surface. The exception are those Hamiltonian forms which are conserved currents:

Proposition

(conserved charges – transgression of conserved currents)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

If a Hamiltonian differential form $J \in \Omega^{p,0}_{\Sigma,Ham}(E)$ (def. ) happens to be a conserved current (def. ) in that its total spacetime derivative vanishes on-shell

d J \vert_{\mathcal{E}} \;= \; 0

then the induced Hamiltonian local observable $\tau_{\Sigma_p}(J)$ (def. ) is independent of the choice of Cauchy surface $\Sigma_p$ (def ) in that for $\Sigma_p, \Sigma'_p \hookrightarrow \Sigma$ any two Cauchy surfaces which are cobordant, then

\tau_{\Sigma_p}(J) = \tau_{\Sigma'_p}(J) \,.

The resulting constant is called the conserved charge of the conserved current, traditionally denoted

Q \;\coloneqq\; \tau_{\Sigma_p}(J) \,.

Proof

By definition the transgression of $d J$ vanishes on the on-shell space of field histories. Therefore the result is given by Stokes' theorem (prop. ).

Definition

(Poisson bracket of Hamiltonian local observables on covariant phase space)

Let $(E, \mathbf{L})$ be a Lagrangian field theory (def. ) where the field bundle $E \overset{fb}{\to} \Sigma$ is a trivial vector bundle over Minkowski spacetime (example ).

We say that the Poisson bracket on Hamiltonian local observables (def. ) is the transgression (def. ) of the local Poisson bracket (def. ) of the corresponding Hamiltonian differential forms (def. ) to the covariant phase space (def. ).

Explicitly: for $\Sigma_p \hookrightarrow \Sigma$ a choice of Cauchy surface (def. ) then the Poisson bracket between two local Hamiltonian observables $\tau_{\Sigma_p}((H_i, v_i))$ is

(124)

\left\{ \tau_{\Sigma_p}((H_1, v_1)) \,,\, \tau_{\Sigma_p}( (H_2, v_2) ) \right\} \;\coloneqq\; \tau_{\Sigma_p}( \, \{ (H_1, v_1), (H_2, v_2) \} \, ) \,,

where on the right we have the transgression of the local Poisson bracket $\{(H_1, v_1), (H_2, v_2)\}$ of Hamiltonian differential forms on the jet bundle from prop. .

Proof

We need to see that equation (124) is well defined, in that it does not depend on the choice of Hamiltonian form $(H_i, v_i)$ representing the local Hamiltonian observable $\tau_{\Sigma_p}(H_i)$ .

It is clear that all the transgressions involved depend only on the restriction of the Hamiltonian forms to the pullback of the jet bundle to the infinitesimal neighbourhood $N_\Sigma \Sigma_p$ . Moreover, the Poisson bracket on the jet bundle (84) clearly respects this restriction.

If a Hamiltonian differential form $H$ is in the kernel of the transgression map relative to $\Sigma_p$ , in that for every smooth collection $\Phi_{(-)} \colon U \to \Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0}$ of field histories (according to def. ) we have (by def. )

\int_{\Sigma_p} j^\infty_\Sigma(\Phi_{(-)})^\ast H \;= \;0 \;\;\; \in \Omega^p(U)

then the fact that the kernel of integration is the exact differential forms says that $j^\infty_\Sigma(\Phi_{(-)})^\ast H \in \Omega^p(U \times \Sigma)$ is $d_\Sigma$ -exact and hence in particular $d_\Sigma$ -closed for all $\Phi_{(-)}$ :

d_\Sigma j^\infty(\Phi_{(-)})^\ast H \;=\; 0 \,.

By prop. this means that

j^\infty(\Phi_{(-)})^\ast ( d H ) \;= \; 0

for all $\Phi_{(-)}$ . Since $H \in \Omega^{p,0}_\Sigma(E)$ is horizontal, the same proposition (see also example ) implies that in fact $H$ is horizontally closed:

d H \;=\; 0 \,.

Now since the field bundle $E \overset{fb}{\to} \Sigma$ is trivial by assumption, prop. applies and says that this horizontally closed form on the jet bundle is in fact horizontally exact.

In conclusion this shows that the kernel of the transgression map $\tau_{\Sigma_p} \;\colon\; \Omega^{p,0}_\Sigma(E) \to C^\infty\left( \Gamma_{\Sigma_p}(E)\right)$ is precisely the space of horizontally exact horizontal $p$ -forms.

Therefore the claim now follows with the statement that horizontally exact Hamiltonian differential forms constitute a Lie ideal for the local Poisson bracket on the jet bundle; this is lemma .

Example

(Poisson bracket of the real scalar field)

Consider the Lagrangian field theory of the free scalar field (example ), and consider the Cauchy surface defined by $x^0 = 0$ .

By example the local Poisson bracket of the Hamiltonian forms

Q \coloneqq \phi \iota_{\partial_0} dvol_\Sigma \in \Omega^{p,0}(E)

and

P \coloneqq \eta^{\mu \nu} \phi_{,\mu} \iota_{\partial_\nu} dvol_{\Sigma} \in \Omega^{p,0}(E) \,.

\{Q,P\} = \iota_{v_Q} \iota_{v_P} \omega = \iota_{\partial_0} dvol_\Sigma \,.

Upon transgression according to def. this yields the following Poisson bracket

\left\{ \int_{\Sigma_p} b_1(\vec x) \phi(t,\vec x) \iota_{\partial_0} dvol_\Sigma(x) d^p \vec x \;,\; \int_{\Sigma_p} b_2(\vec x) \partial_0 \phi(t,\vec x) \iota_{\partial_0} dvol_\Sigma(\vec x) \right\} \;=\; \int_{\Sigma_p} b_1(\vec x) b_2(\vec x) \iota_{\partial_0} dvol_\Sigma(\vec x) d^p \vec x \,,

where

\mathbf{\Phi}(x), \partial_0 \mathbf{\Phi}(x) \;:\; PhaseSpace(\Sigma_p^t) \to \mathbb{R}

denote the point-evaluation observables (example ), which act on a field history $\Phi \in \Gamma_\Sigma(E) = C^\infty(\Sigma)$ as

\mathbf{\Phi}(x) \;\colon\; \Phi \mapsto \Phi(x) \phantom{AAAAAAAA} \partial_0 \mathbf{\Phi}(x) \;\colon\; \Phi \mapsto \partial_0 \Phi(x) \,.

Notice that these point-evaluation functions themselves do not arise as the transgression of elements in $\Omega^{p,0}(E)$ ; only their smearings such as $\int_{\Sigma_p} b_1 \phi dvol_{\Sigma_p}$ do. Nevertheless we may express the above Poisson bracket conveniently via the integral kernel

(125)

\left\{ \mathbf{\Phi}(t,\vec x), \partial_0\mathbf{\Phi}(t,\vec y) \right\} \;=\; \delta(\vec x - \vec y) \,.

Proposition

(super-Poisson bracket of the Dirac field)

Consider the Lagrangian field theory of the free Dirac field on Minkowski spacetime (example ) with field bundle the odd-shifted spinor bundle $E = \Sigma \times S_{odd}$ (example ) and with

\theta \Psi_\alpha(x) \;\colon\; \mathbb{R}^{0\vert 1} \longrightarrow \left[ \Gamma_\Sigma(\Sigma \times S_{odd})_{\delta_{EL}\mathbf{L} = 0}, \mathbb{C} \right]

the corresponding odd-graded point-evaluation observable (example ).

Then consider the Cauchy surfaces in Minkowski spacetime (def. ) given by $x^0 = t$ for $t \in \mathbb{R}$ . Under transgression to this Cauchy surface via def. , the local Poisson bracket, which by example is given by the super Lie bracket

\left\{ \left( \gamma^\mu \psi \right)_\alpha \, \iota_{\partial_\mu} dvol_\Sigma \,,\, \left(\overline{\psi}\gamma^\mu\right)^\beta\, \iota_{\partial_\mu} dvol_\Sigma \right\} \;=\; \left(\gamma^\mu\right)_\alpha{}^{\beta} \, \iota_{\partial_\mu} dvol_\Sigma \,,

has integral kernel

\left\{ \psi_\alpha(t,\vec x) , \overline{\psi}^\beta(t,\vec y) \right\} \;=\; (\gamma^0)_{\alpha}{}^\beta \delta(\vec y - \vec x) \,.

$\,$

This concludes our discussion of the phase space and the Poisson-Peierls bracket for well behaved Lagrangian field theories. In the next chapter we discuss in detail the integral kernels corresponding to the Poisson-Peierls bracket for key classes of examples. These are the propagators of the theory.

Propagators

In this chapter we discuss the following topics:

Background
Propagators for the free scalar field on Minkowski spacetime
Propagators for the Dirac field on Minkowski spacetime

$\,$

In the previous chapter we have seen the covariant phase space (prop. ) of sufficiently nice Lagrangian field theories, which is the on-shell space of field histories equipped with the presymplectic form transgressed from the presymplectic current of the theory; and we have seen that in good cases this induces a bilinear pairing on sufficiently well-behaved observables, called the Poisson bracket (def. ), which reflects the infinitesimal symmetries of the presymplectic current. This Poisson bracket is of central importance for passing to actual quantum field theory, since, as we will discuss in Quantization below, it is the infinitesimal approximation to the quantization of a Lagrangian field theory.

We have moreover seen that the Poisson bracket on the covariant phase space of a free field theory with Green hyperbolic equations of motion – the Peierls-Poisson bracket – is determined by the integral kernel of the causal Green function (prop. ). Under the identification of linear on-shell observables with off-shell observables that are generalized solutions to the equations of motion (theorem ) the convolution with this integral kernel may be understood as propagating the values of an off-shell observable through spacetime, such as to then compare it with any other observable at any spacetime point (prop. ). Therefore the integral kernel of the causal Green function is also called the causal propagator (prop. ).

This means that for Green hyperbolic free Lagrangian field theory the Poisson bracket, and hence the infinitesimal quantization of the theory, is all encoded in the causal propagator. Therefore here we analyze the causal propagator, as well as its variant propagators, in detail.

The main tool for these computations is Fourier analysis (reviewed below) by which field histories, observables and propagators on Minkowski spacetime are decomposed as superpositions of plane waves of various frequencies, wave lengths and wave vector-direction. Using this, all propagators are exhibited as those superpositions of plane waves which satisfy the dispersion relation of the given equation of motion, relating plane wave frequency to wave length.

This way the causal propagator is naturally decomposed into its contribution from positive and from negative frequencies. The positive frequency part of the causal propagator is called the Wightman propagator (def. below). It turns out (prop. below) that this is equivalently the sum of the causal propagator, which itself is skew-symmetric (cor. below), with a symmetric component, or equivalently that the causal propagator is the skew-symmetrization of the Wightman propagator. After quantization of free field theory discussed further below, we will see that the Wightman propagator is equivalently the correlation function between two point-evaluation field observables (example ) in a vacuum state of the field theory (a state in the sense of def. ).

Moreover, by def. the causal propagator also decomposes into its contributions with future and past support, given by the difference between the advanced and retarded propagators. These we analyze first, starting with prop. below.

Combining these two decompositions of the causal propagator (positive/negative frequency as well as positive/negative time) yields one more propagator, the Feynman propagator (def. below).

We will see below that the quantization of a free field theory is given by a “star product” (on observables) which is given by “exponentiating” these propagators. For that to make sense, certain pointwise products of these propagators, regarded as generalized functions (prop. ) need to exist. But since the propagators are distributions with singularities, the existence of these products requires that certain potential “UV divergences” in their Fourier transforms (remark below) are absent (“Hörmander's criterion”, prop. below). These UV divergences are captured by what what is called the wave front set (def. below).

The study of UV divergences of distributions via their wave front sets is called microlocal analysis and provides powerful tools for the understanding of quantum field theory. In particular the propagation of singularities theorem (prop. ) shows that for distributional solutions (def. ) of Euler-Lagrange equations of motion, such as the propagators, their singular support propagates itself through spacetime along the wave front set.

Using this theorem we work out the wave front sets of the propagators (prop. below). Via Hörmander's criterion (prop. ) this computation will serve to show why upon quantization the Wightman propagator replaces the causal propagator in the construction of the Wick algebra of quantum observables of the free field theory (discussed below in Free quantum fields) and the Feynman propagator similarly controls the quantum observables of the interacting quantum field theory (below in Feynman diagrams).

$\,$

The following table summarizes the structure of the system of propagators. (The column “as vacuum expectation value of field operators” will be discussed further below in Free quantum fields).

$\,$

propagators (i.e. integral kernels of Green functions)
for the wave operator and Klein-Gordon operator
on a globally hyperbolic spacetime such as Minkowski spacetime:

name	symbol	wave front set	as vacuum exp. value of field operators	as a product of field operators
causal propagator	$\begin{aligned}\Delta_S & = \Delta_+ - \Delta_- \end{aligned}$	$\phantom{A}\,\,\,-$	$\begin{aligned} & i \hbar \, \Delta_S(x,y) = \\ & \left\langle \;\left[\mathbf{\Phi}(x),\mathbf{\Phi}(y)\right]\; \right\rangle \end{aligned}$	Peierls-Poisson bracket
advanced propagator	$\Delta_+$		$\begin{aligned} & i \hbar \, \Delta_+(x,y) = \\ & \left\{ \array{ \left\langle \; \left[ \mathbf{\Phi}(x),\mathbf{\Phi}(y) \right] \; \right\rangle &\vert& x \geq y \\ 0 &\vert& y \geq x } \right. \end{aligned}$	future part of Peierls-Poisson bracket
retarded propagator	$\Delta_-$		$\begin{aligned} & i \hbar \, \Delta_-(x,y) = \\ & \left\{ \array{ \left\langle \; \left[\mathbf{\Phi}(x),\mathbf{\Phi}(y) \right] \; \right\rangle &\vert& y \geq x \\ 0 &\vert& x \geq y } \right. \end{aligned}$	past part of Peierls-Poisson bracket
Wightman propagator	$\begin{aligned} \Delta_H &= \tfrac{i}{2}\left( \Delta_+ - \Delta_-\right) + H\\ & = \tfrac{i}{2}\Delta_S + H \\ & = \Delta_F - i \Delta_- \end{aligned}$		$\begin{aligned} & \hbar \, \Delta_H(x,y) \\ & = \left\langle \; \mathbf{\Phi}(x) \mathbf{\Phi}(y) \; \right\rangle \\ & = \underset{ = 0 }{\underbrace{\left\langle \; : \mathbf{\Phi}(x) \mathbf{\Phi}(y) : \; \right\rangle}} \\ & \phantom{=} + \left\langle \; \left[ \mathbf{\Phi}^{(-)}(x), \mathbf{\Phi}^{(+)}(y) \right] \; \right\rangle \end{aligned}$	positive frequency of Peierls-Poisson bracket, Wick algebra-product, 2-point function $\phantom{=}$ of vacuum state $\phantom{=}$ or generally of $\phantom{=}$ Hadamard state
Feynman propagator	$\begin{aligned}\Delta_F & = \tfrac{i}{2}\left( \Delta_+ + \Delta_- \right) + H \\ & = i \Delta_D + H \\ & = \Delta_H + i \Delta_- \end{aligned}$		$\begin{aligned} & \hbar \, \Delta_F(x,y) \\ & = \left\langle \; T\left( \; \mathbf{\Phi}(x)\mathbf{\Phi}(y) \;\right) \; \right\rangle \\ & = \left\{ \array{ \left\langle \; \mathbf{\Phi}(x)\mathbf{\Phi}(x) \; \right\rangle &\vert& x \geq y \\ \left\langle \; \mathbf{\Phi}(y) \mathbf{\Phi}(x) \; \right\rangle &\vert& y \geq x } \right.\end{aligned}$	time-ordered product

(see also Kocic‘s overview: pdf)

$\,$

Fourier analysis and plane wave modes

By definition, the equations of motion of free field theories (def. ) are linear partial differential equations and hence lend themselves to harmonic analysis, where all field histories are decomposed into superpositions of plane waves via Fourier transform. Here we briefly survey the relevant definitions and facts of Fourier analysis.

In formal duality to the harmonic analysis of the field histories themselves, also the linear observables (def. ) on the space of field histories, hence the distributional generalized functions (prop. ) are subject to Fourier transform of distributions (def. below).

Throughout, let $n \in \mathbb{N}$ and consider the Cartesian space $\mathbb{R}^n$ of dimension $n$ (def. ). In the application to field theory, $n = p + 1$ is the dimension of spacetime and $\mathbb{R}^n$ is either Minkowski spacetime $\mathbb{R}^{p,1}$ (def. ) or its dual vector space, thought of as the space of wave vectors (def. below). For $x = (x^\mu) \in \mathbb{R}^{p,1}$ and $k = (k_\mu) \in (\mathbb{R}^(p,1))^\ast$ we write

x \cdot k \;=\; x^\mu k_\mu

for the canonical pairing.

Definition

(plane wave)

A plane wave on Minkowski spacetime $\mathbb{R}^{p,1}$ (def. ) is a smooth function with values in the complex numbers given by

\array{ \mathbb{R}^{p,1} &\longrightarrow& \mathbb{C} \\ (x^\mu) &\mapsto& e^{i k_\mu x^\mu} }

for $k = (k_\mu) \in (\mathbb{R}^{p,1})^\ast$ a covector, called the wave vector of the plane wave.

We use the following terminology:

plane waves on Minkowski spacetime

\array{ \mathbb{R}^{p,1} &\overset{\psi_k}{\longrightarrow}& \mathbb{C} \\ x &\mapsto& \exp\left( \, i k_\mu x^\mu \, \right) \\ (\vec x, x^0) &\mapsto& \exp\left( \, i \vec k \cdot \vec x + i k_0 x^0 \, \right) \\ (\vec x, c t) &\mapsto& \exp\left( \, i \vec k \cdot \vec x - i \omega t \, \right) }

symbol	name
$c$	speed of light
$\hbar$	Planck's constant
$\,$	$\,$
$m$	mass
$\frac{\hbar}{m c}$	Compton wavelength
$\,$	$\,$
$k$ , $\vec k$	wave vector
$\lambda = 2\pi/{\vert \vec k \vert}$	wave length
${\vert \vec k \vert} = 2\pi/\lambda$	wave number
$\omega \coloneqq k^0 c = -k_0 c = 2\pi \nu$	angular frequency
$\nu = \omega / 2 \pi$	frequency
$p = \hbar k$ , $\vec p = \hbar \vec k$	momentum
$E = \hbar \omega$	energy
$\omega(\vec k) = c \sqrt{ \vec k^2 + \left(\frac{m c}{\hbar}\right)^2 }$	Klein-Gordon dispersion relation
$E(\vec p) = \sqrt{ c^2 \vec p^2 + (m c^2)^2 }$	energy-momentum relation

Definition

(Schwartz space of functions with rapidly decreasing partial derivatives)

A complex-valued smooth function $f \in C^\infty(\mathbb{R}^n)$ is said to have rapidly decreasing partial derivatives if for all $\alpha,\beta \in \mathbb{N}^{n}$ we have

\underset{x \in \mathbb{R}^n}{sup} {\vert x^\beta \partial^\alpha f(x) \vert} \;\lt\; \infty \,.

Write

\mathcal{S}(\mathbb{R}^n) \hookrightarrow C^\infty(\mathbb{R}^n)

for the sub-vector space on the functions with rapidly decreasing partial derivatives regarded as a topological vector space for the Fréchet space structure induced by the seminorms

p_{\alpha, \beta}(f) \coloneqq \underset{x \in \mathbb{R}^n}{sup} {\vert x^\beta \partial^\alpha f(x) \vert} \,.

This is also called the Schwartz space.

(e.g. Hörmander 90, def. 7.1.2)

Example

(compactly supported smooth function are functions with rapidly decreasing partial derivatives)

Every compactly supported smooth function (bump function) $b \in C^\infty_{cp}(\mathbb{R}^n)$ has rapidly decreasing partial derivatives (def. ):

C^\infty(\mathbb{R}^n) \hookrightarrow \mathcal{S}(\mathbb{R}^n) \,.

Proposition

(pointwise product and convolution product on Schwartz space)

The Schwartz space $\mathcal{S}(\mathbb{R}^n)$ (def. ) is closed under the following operatios on smooth functions $f,g \in \mathcal{S}(\mathbb{R}^n) \hookrightarrow C^\infty(\mathbb{R}^n)$

pointwise product:

$(f \cdot g)(x) \coloneqq f(x) \cdot g(x)$
convolution product:

$(f \star g)(x) \coloneqq \underset{y \in \mathbb{R}^n}{\int} f(y)\cdot g(x-y) \, dvol(y) \,.$

Proof

By the product law of differentiation.

Proposition

(rapidly decreasing functions are integrable)

Every rapidly decreasing function $f \colon \mathbb{R}^n \to \mathbb{R}$ (def. ) is an integrable function in that its integral exists:

\underset{x \in \mathbb{R}^n}{\int} f(x) \, d^n x \;\lt\; \infty

In fact for each $\alpha \in \mathbb{N}^n$ the product of $f$ with the $\alpha$ -power of the coordinate functions exists:

\underset{x \in \mathbb{R}^n}{\int} x^\alpha f(x)\, d^n x \;\lt\; \infty \,.

Definition

(Fourier transform of functions with rapidly decreasing partial derivatives)

The Fourier transform is the continuous linear functional

\widehat{(-)} \;\colon\; \mathcal{S}(\mathbb{R}^n) \longrightarrow \mathcal{S}(\mathbb{R}^n)

on the Schwartz space of functions with rapidly decreasing partial derivatives (def. ), which is given by integration against plane wave functions (def. )

x \mapsto e^{- i k \cdot x}

times the standard volume form $d^n x$ :

(126)

\hat f(k) \;\colon\; \int_{x \in \mathbb{R}^n} e^{- i \, k \cdot x} f(x) \, d^n x \,.

Here the argument $k \in \mathbb{R}^n$ of the Fourier transform is also called the wave vector.

(e.g. Hörmander, lemma 7.1.3)

Proposition

(Fourier inversion theorem)

The Fourier transform $\widehat{(-)}$ (def. ) on the Schwartz space $\mathcal{S}(\mathbb{R}^n)$ (def. ) is an isomorphism, with inverse function the inverse Fourier transform

\widecheck {(-)} \;\colon\; \mathcal{S}(\mathbb{R}^n) \longrightarrow \mathcal{S}(\mathcal{R}^n)

given by

\widecheck g (x) \;\coloneqq\; \underset{k \in \mathbb{R}^n}{\int} g(k) e^{i k \cdot x} \, \frac{d^n k}{(2\pi)^n} \,.

Hence in the language of harmonic analysis the function $\widecheck g \colon \mathbb{R}^n \to \mathbb{C}$ is the superposition of plane waves (def. ) in which the plane wave with wave vector $k\in \mathbb{R}^n$ appears with amplitude $g(k)$ .

(e.g. Hörmander, theorem 7.1.5)

Proposition

(basic properties of the Fourier transform)

The Fourier transform $\widehat{(-)}$ (def. ) on the Schwartz space $\mathcal{S}(\mathbb{R}^n)$ (def. ) satisfies the following properties, for all $f,g \in \mathcal{S}(\mathbb{R}^n)$ :

(interchanging coordinate multiplication with partial derivatives)

(127) $\widehat{ x^a f } = + i \partial_a \widehat f \phantom{AAAAA} \widehat{ - i\partial_a f} = k_a \widehat f$
(interchanging pointwise multiplication with convolution product, remark ):

(128) $\widehat {(f \star g)} = \widehat{f} \cdot \widehat{g} \phantom{AAAA} \widehat{ f \cdot g } = (2\pi)^{-n} \widehat{f} \star \widehat{g}$
(unitarity, Parseval's theorem)

$\underset{x \in \mathbb{R}^n}{\int} f(x) g^\ast(x)\, d^n x \;=\; \underset{k \in \mathbb{R}^n}{\int} \widehat{f}(k) \widehat{g}^\ast(k) \, d^n k$
(129) $\underset{k \in \mathbb{R}^n}{\int} \widehat{f}(k) \cdot g(k) \, d^n k \;=\; \underset{x \in \mathbb{R}^n}{\int} f(x) \cdot \widehat{g}(x) \, d^n x$

(e.g Hörmander 90, lemma 7.1.3, theorem 7.1.6)

The Schwartz space of functions with rapidly decreasing partial derivatives (def. ) serves the purpose to support the Fourier transform (def. ) together with its inverse (prop. ), but for many applications one needs to apply the Fourier transform to more general functions, and in fact to generalized functions in the sense of distributions (via this prop.). But with the Schwartz space in hand, this generalization is readily obtained by formal duality:

Definition

(tempered distribution)

A tempered distribution is a continuous linear functional

u \;\colon\; \mathcal{S}(\mathbb{R}^n) \longrightarrow \mathbb{C}

on the Schwartz space (def. ) of functions with rapidly decaying partial derivatives. The vector space of all tempered distributions is canonically a topological vector space as the dual space to the Schwartz space, denoted

\mathcal{S}'(\mathbb{R}^n) \;\coloneqq\; \left( \mathcal{S}(\mathbb{R}^n) \right)^\ast \,.

e.g. (Hörmander 90, def. 7.1.7)

Example

(some non-singular tempered distributions)

Every function with rapidly decreasing partial derivatives $f \in \mathcal{S}(\mathbb{R}^n)$ (def. ) induces a tempered distribution $u_f \in \mathcal{S}'(\mathbb{R}^n)$ (def. ) by integrating against it:

u_f \;\colon\; g \mapsto \underset{x \in \mathbb{R}^n}{\int} g(x) f(x)\, d^n x \,.

This construction is a linear inclusion

\mathcal{S}(\mathbb{R}^n) \overset{\text{dense}}{\hookrightarrow} \mathcal{S}'(\mathbb{R}^n)

of the Schwartz space into its dual space of tempered distributions. This is a dense subspace inclusion.

In fact already the restriction of this inclusion to the compactly supported smooth functions (example ) is a dense subspace inclusion:

C^\infty_{cp}(\mathbb{R}^n) \overset{dense}{\hookrightarrow} \mathcal{S}'(\mathbb{R}^n) \,.

This means that every tempered distribution is a limit of a sequence of ordinary functions with rapidly decreasing partial derivatives, and in fact even the limit of a sequence of compactly supported smooth functions (bump functions).

It is in this sense that tempered distributions are “generalized functions”.

(e.g. Hörmander 90, lemma 7.1.8)

Example

(compactly supported distributions are tempered distributions)

Every compactly supported distribution is a tempered distribution (def. ), hence there is a linear inclusion

\mathcal{E}'(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n) \,.

Example

(delta distribution)

Write

\delta_0(-) \;\in\; \mathcal{E}'(\mathbb{R}^n)

for the distribution given by point evaluation of functions at the origin of $\mathbb{R}^n$ :

\delta_0(-) \;\colon\; f \mapsto f(0) \,.

This is clearly a compactly supported distribution; hence a tempered distribution by example .

We write just “ $\delta(-)$ ” (without the subscript) for the corresponding generalized function (example ), so that

\underset{x \in \mathbb{R}^n}{\int} \delta(x) f(x) \, d^n x \;\coloneqq\; f(0) \,.

Example

(square integrable functions induce tempered distributions)

Let $f \in L^p(\mathbb{R}^n)$ be a function in the $p$ th Lebesgue space, e.g. for $p = 2$ this means that $f$ is a square integrable function. Then the operation of integration against the measure $f dvol$

g \mapsto \underset{x \in \mathbb{R}^n}{\int} g(x) f(x) \, d^n x

is a tempered distribution (def. ).

(e.g. Hörmander 90, below lemma 7.1.8)

Property (129) of the ordinary Fourier transform on functions with rapidly decreasing partial derivatives motivates and justifies the fullowing generalization:

Definition

(Fourier transform of distributions on tempered distributions)

The Fourier transform of distributions of a tempered distribution $u \in \mathcal{S}'(\mathbb{R}^n)$ (def. ) is the tempered distribution $\widehat u$ defined on a smooth function $f \in \mathcal{S}(\mathbb{R}^n)$ in the Schwartz space (def. ) by

\widehat{u}(f) \;\coloneqq\; u\left( \widehat f\right) \,,

where on the right $\widehat f \in \mathcal{S}(\mathbb{R}^n)$ is the Fourier transform of functions from def. .

(e.g. Hörmander 90, def. 1.7.9)

Example

(Fourier transform of distributions indeed generalizes Fourier transform of functions with rapidly decreasing partial derivatives)

Let $u_f \in \mathcal{S}'(\mathbb{R}^n)$ be a non-singular tempered distribution induced, via example , from a function with rapidly decreasing partial derivatives $f \in \mathcal{S}(\mathbb{R}^n)$ .

Then its Fourier transform of distributions (def. ) is the non-singular distribution induced from the Fourier transform of $f$ :

\widehat{u_f} \;=\; u_{\hat f} \,.

Proof

Let $g \in \mathcal{S}(\mathbb{R}^n)$ . Then

\begin{aligned} \widehat{u_f}(g) & \coloneqq u_f\left( \widehat{g}\right) \\ & = \underset{x \in \mathbb{R}^n}{\int} f(x) \hat g(x)\, d^n x \\ & = \underset{x \in \mathbb{R}^n}{\int} \hat f(x) g(x) \, d^n x \\ & = u_{\hat f}(g) \end{aligned}

Here all equalities hold by definition, except for the third: this is property (129) from prop. .

Example

(Fourier transform of Klein-Gordon equation of distributions)

Let $\Delta \in \mathcal{S}'(\mathbb{R}^{p,1})$ be any tempered distribution (def. ) on Minkowski spacetime (def. ) and let $P \coloneqq \eta^{\mu \nu} \frac{\partial}{\partial x^\mu}\frac{\partial}{\partial x^\nu} - \left( \tfrac{m c}{\hbar} \right)^2$ be the Klein-Gordon operator (70). Then the Fourier transform (def. ) of $P \Delta$ is, in generalized function-notation (remark )given by

\widehat {P \Delta}(k) \;=\; \left( - \eta^{\mu \nu}k_\mu k_\nu - \left( \tfrac{m c}{\hbar}\right)^2 \right) \widehat(k) \,.

Proof

Let $r \in \mathcal{S}(\mathbb{R}^n)$ be any function with rapidly decreasing partial derivatives (def. ). Then

\begin{aligned} \widehat {P \Delta}(r) & = P \Delta(\widehat r) \\ & = \Delta(P^\ast \widehat r) \\ & = \Delta(P \widehat r) \\ & = \Delta\left( \left(-\eta^{\mu \nu}k_\mu k_\nu - \left( \tfrac{m c}{\hbar}\right)^2\right) \widehat{r} \right) \end{aligned}

Here the first step is def. , the second is def. , the third is example , while the last step is prop. .

Example

(Fourier transform of compactly supported distributions)

Under the identification of smooth functions of bounded growth with non-singular tempered distributions (example ), the Fourier transform of distributions (def. ) of a tempered distribution that happens to be compactly supported (example )

u \in \mathcal{E}'(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n)

is simply

\widehat{u}(k) = u\left( e^{- i k \cdot (-)}\right) \,.

(Hörmander 90, theorem 7.1.14)

Example

(Fourier transform of the delta-distribution)

The Fourier transform (def. ) of the delta distribution (def. ), via example , is the constant function on 1:

\begin{aligned} \widehat {\delta}(k) & = \underset{x \in \mathbb{R}^n}{\int} \delta(x) e^{- i k x} \, d x \\ & = 1 \end{aligned}

This implies by the Fourier inversion theorem (prop. ) that the delta distribution itself has equivalently the following expression as a generalized function

\begin{aligned} \delta(x) & = \widecheck{\widehat {\delta_0}}(x) \\ & = \underset{k \in \mathbb{R}^n}{\int} e^{i k \cdot x} \, \frac{d^n k}{ (2\pi)^n } \end{aligned}

in the sense that for every function with rapidly decreasing partial derivatives $f \in \mathcal{S}(\mathbb{R}^n)$ (def. ) we have

\begin{aligned} f(x) & = \underset{y \in \mathbb{R}^n}{\int} f(y) \delta(y-x) \, d^n y \\ & = \underset{y \in \mathbb{R}^n}{\int} \underset{k \in \mathbb{R}^n}{\int} f(y) e^{i k \cdot (y-x)} \, \frac{d^n k}{(2\pi)^n} \, d^n y \\ & = \underset{k \in \mathbb{R}^n}{\int} e^{- i k \cdot x} \underset{= \widehat{f}(-k) }{ \underbrace{ \underset{y \in \mathbb{R}^n}{\int} f(y) e^{i k \cdot y} \, d^n y } } \,\, \frac{d^n k}{(2\pi)^n} \\ & = + \underset{k \in \mathbb{R}^n}{\int} e^{i k \cdot x} \underset{= \widehat{f}(k) }{ \underbrace{ \underset{y \in \mathbb{R}^n}{\int} f(y) e^{- i k \cdot y} \, d^n y } } \,\, \frac{d^n k}{(2\pi)^n} \\ & = \widecheck{\widehat{f}}(x) \end{aligned}

which is the statement of the Fourier inversion theorem for smooth functions (prop. ).

(Here in the last step we used change of integration variables $k \mapsto -k$ which introduces one sign $(-1)^{n}$ for the new volume form, but another sign $(-1)^n$ from the re-orientation of the integration domain. )

Equivalently, the above computation shows that the delta distribution is the neutral element for the convolution product of distributions.

Proposition

(Paley-Wiener-Schwartz theorem I)

Let $u \in \mathcal{E}'(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n)$ be a compactly supported distribution regarded as a tempered distribution by example . Then its Fourier transform of distributions (def. ) is a non-singular distribution induced from a smooth function that grows at most exponentially.

(e.g. Hoermander 90, theorem 7.3.1)

Proposition

(Fourier inversion theorem for Fourier transform of distributions)

The operation of forming the Fourier transform of distributions $\widehat{u}$ (def. ) tempered distributions $u \in \mathcal{S}'(\mathbb{R}^n)$ (def. ) is an isomorphism, with inverse given by

\widecheck{ u } \;\colon\; g \mapsto u\left( \widecheck{g}\right) \,,

where on the right $\widecheck{g}$ is the ordinary inverse Fourier transform of $g$ according to prop. .

Proof

By def. this follows immediately from the Fourier inversion theorem for smooth functions (prop. ).

We have the following distributional generalization of the basic property (128) from prop. :

Proposition

(Fourier transform of distributions interchanges convolution of distributions with pointwise product)

Let

u_1 \in \mathcal{S}'(\mathbb{R}^n)

be a tempered distribution (def. ) and

u_2 \in \mathcal{E}'(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n)

be a compactly supported distribution, regarded as a tempered distribution via example .

Observe here that the Paley-Wiener-Schwartz theorem (prop. ) implies that the Fourier transform of distributions of $u_1$ is a non-singular distribution $\widehat{u_1} \in C^\infty(\mathbb{R}^n)$ so that the product $\widehat{u_1} \cdot \widehat{u_2}$ is always defined.

Then the Fourier transform of distributions of the convolution product of distributions is the product of the Fourier transform of distributions:

\widehat{u_1 \star u_2} \;=\; \widehat{u_1} \cdot \widehat{u_2} \,.

(e.g. Hörmander 90, theorem 7.1.15)

Remark

(product of distributions via Fourier transform of distributions)

Prop. together with the Fourier inversion theorem (prop. ) suggests to define the product of distributions $u_1 \cdot u_2$ for compactly supported distributions $u_1, u_2 \in \mathcal{E}'(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n)$ by the formula

\widehat{ u_1 \cdot u_2 } \;\coloneqq\; (2\pi)^n \widehat{u_1} \star \widehat{u_2}

which would complete the generalization of of property (128) from prop. .

For this to make sense, the convolution product of the smooth functions on the right needs to exist, which is not guaranteed (prop. does not apply here!). The condition that this exists is the Hörmander criterion on the wave front set (def. ) of $u_1$ and $u_2$ (prop. belwo). This we further discuss in Microlocal analysis and UV-Divergences below.

$\,$

microlocal analysis and ultraviolet divergences

A distribution (def. ) or generalized function (prop. ) is like a smooth function which may have “singularities”, namely points at which it values or that of its derivatives “become infinite”. Conversely, smooth functions are the non-singular distributions (prop. ). The collection of points around which a distribution is singular (i.e. not non-singular) is called its singular support (def. below).

The Fourier transform of distributions (def. ) decomposes a generalized function into the plane wave modes that it is made of (def. ). The Paley-Wiener-Schwartz theorem (prop. below) says that the singular nature of a compactly supported distribution may be read off from this Fourier mode decomposition: Singularities correspond to large contributions by Fourier modes of high frequency and small wavelength, hence to large “ultraviolet” (UV) contributions (remark below). Therefore the singular support of a distribution is the set of points around which the Fourier transform does not sufficiently decay “in the UV”.

But since the Fourier transform is a function of the full wave vector of the plane wave modes (def. ), not just of the frequency/wavelength, but also of the direction of the wave vector, this means that it contains directional information about the singularities: A distribution may have UV-singularities at some point and in some wave vector direction, but maybe not in other directions.

In particular, if the distribution in question is a distributional solution to a partial differential equation (def. ) on spacetime then the propagation of singularities theorem (prop. below) says that the singular support of the solution evolves in spacetime along the direction of those wave vectors in which the Fourier transform exhibits high UV constributions. This means that these directions are the “wave front” of the distributional solution. Accordingly, the singular support of a distribution together with, over each of its points, the directions of wave vectors in which the Fourier transform around that point has large UV constributions is called the wave front set of the distribution (def. below).

What is called microlocal analysis is essentially the analysis of distributions with attention to their wave front set, hence to the wave vector-directions of UV divergences.

In particular the product of distributions is well defined (only) if the wave front sets of the distributions to not “collide”. And this in fact motivates the definition of the wave front set:

To see this, let $u,v \in \mathcal{D}'(\mathbb{R}^1)$ be two distributions, for simplicity of exposition taken on the real line.

Since the product $u \cdot v$ , is, if it exists, supposed to generalize the pointwise product of smooth functions, it must be fixed locally: for every point $x \in \mathbb{R}$ there ought to be a compactly supported smooth function (bump function) $b \in C^\infty_{cp}(\mathbb{R})$ with $f(x) = 1$ such that

b^2 u \cdot v = (b u) \cdot (b v) \,.

But now $b v$ and $b u$ are both compactly supported distributions (def. below), and these have the special property that their Fourier transforms $\widehat{b v}$ and $\widehat{b u}$ are, in particular, smooth functions (by the Paley-Wiener-Schwartz theorem, prop ).

Moreover, the operation of Fourier transform interchanges pointwise products with convolution products (prop. ). This means that if the product of distributions $u \cdot v$ exists, it must locally be given by the inverse Fourier transform of the convolution product of the Fourier transforms $\widehat {b u}$ and $\widehat b v$ :

\widehat{ b^2 u \cdot v }(x) \;=\; \underset{\underset{k_{max} \to \infty}{\longrightarrow}}{\lim} \, \int_{- k_{max}}^{k_{max}} \widehat{(b u)}(k) \widehat{(b v)}(x - k) d k \,.

(Notice that the converse of this formula holds as a fact by prop. )

This shows that the product of distributions exists once there is a bump function $b$ such that the integral on the right converges as $k_{max} \to \infty$ .

Now the Paley-Wiener-Schwartz theorem says more, it says that the Fourier transforms $\widehat {b u}$ and $\widehat {b u}$ are polynomially bounded. On the other hand, the integral above is well defined if the integrand decreases at least quadratically with $k \to \infty$ . This means that for the convolution product to be well defined, either $\widehat {b u}$ has to polynomially decrease faster with $k \to \pm \infty$ than $\widehat {b v}$ grows in the other direction, $k \to \mp \infty$ (due to the minus sign in the argument of the second factor in the convolution product), or the other way around.

Moreover, the degree of polynomial growth of the Fourier transform increases by one with each derivative (def. ). Therefore if the product law for derivatives of distributions is to hold generally, we need that either $\widehat{b u}$ or $\widehat{b v}$ decays faster than any polynomial in the opposite of the directions in which the respective other factor does not decay.

Here the set of directions of wave vectors in which the Fourier transform of a distribution localized around any point does not decay exponentially is the wave front set of a distribution (def. below). Hence the condition that the product of two distributions is well defined is that for each wave vector direction in the wave front set of one of the two distributions, the opposite direction must not be an element of the wave front set of the other distribution. This is called Hörmander's criterion (prop. below).

We now say this in detail:

Definition

(restriction of distributions)

For $U \subset \mathbb{R}^n$ a subset, and $u \in \mathcal{D}'(\mathbb{R}^n)$ a distribution, then the restriction of $u$ to $U$ is the distribution

u\vert_U \in \mathcal{D}'(U)

give by restricting $u$ to test functions whose support is in $U$ .

Definition

(singular support of a distribution)

Given a distribution $u \in \mathcal{D}'(\mathbb{R}^n)$ , a point $x \in \mathbb{R}^n$ is a singular point if there is no neighbourhood $U \subset \mathbb{R}^n$ of $x$ such that the restriction $u\vert_U$ (def. ) is a non-singular distribution (given by a smooth function).

The set of all singular points is the singular support $supp_{sing}(u) \subset \mathbb{R}^n$ of $u$ .

Definition

(product of a distribution with a smooth function)

Let $u \in \mathcal{D}'(\mathbb{R}^n)$ be a distribution, and $f \in C^\infty(\mathbb{R}^n)$ a smooth function. Then the product $f u \in \mathcal{D}'(\mathbb{R}^n)$ is the evident distribution given on a test function $b \in C^\infty_{cp}(\mathbb{R}^n)$ by

f u \;\colon\; u \mapsto u(f \cdot b) \,

Proposition

(Paley-Wiener-Schwartz theorem II – decay of Fourier transform of compactly supported functions)

A compactly supported distribution $u \in \mathcal{E}'(\mathbb{R}^n)$ is non-singular, hence given by a compactly supported function $b \in C^\infty_{cp}(\mathbb{R}^n)$ via $u(f) = \int b(x) f(x) dvol(x)$ , precisely if its Fourier transform $\hat u$ (this def.) satisfies the following decay property:

For all $N \in \mathbb{N}$ there exists $C_N \in \mathbb{R}_+$ such that for all $k \in \mathbb{R}^n$ we have that the absolute value ${\vert \hat v(k)\vert}$ of the Fourier transform at that point is bounded by

(130)

{\vert \hat v(k)\vert} \;\leq\; C_N \left( 1 + {\vert k\vert} \right)^{-N} \,.

(Hörmander 90, around (8.1.1))

Remark

(ultraviolet divergences)

In words, the Paley-Wiener-Schwartz theorem II (prop. ) says that the singularities of a distribution “in position space” are reflected in non-decaying contributions of high frequencies (small wavelength) in its Fourier mode-decomposition (def. ). Since for ordinary light waves one associates high frequency with the “ultraviolet”, we may think of these as “ultaviolet contributions”.

But apart from the wavelength, the wave vector that the Fourier transform of distributions depends on also encodes the direction of the corresponding plane wave. Therefore the Paley-Wiener-Schwartz theorem says in more detail that a distribution is singular at some point already if along any one direction of the wave vector its local Fourier transform picks up ultraviolet contributions in that direction.

It therefore makes sense to record this extra directional information in the singularity structure of a distribution. This is called the wave front set (def. ) below. The refined study of singularities taking this directional information into account is what is called microlocal analysis.

Moreover, the Paley-Wiener-Schwartz theorem I (prop. ) says that if the ultraviolet contributions diverge more than polynomially with high frequency, then the corresponding would-be compactly supported distribution is not only singular, but is actually ill defined.

Such ultraviolet divergences appear notably when forming a would-be product of distributions whose two factors have wave front sets whose UV-contributions “add up”. This condition for the appearance/avoidance of UV-divergences is called Hörmander's criterion (prop. below).

Definition

(wavefront set)

Let $u \in \mathcal{D}'(\mathbb{R}^n)$ be a distribution. For $b \in C^\infty_{cp}(\mathbb{R}^n)$ a compactly supported smooth function, write $b u \in \mathcal{E}'(\mathbb{R}^n)$ for the corresponding product (def. ), which is now a compactly supported distribution.

For $x\in supp(b) \subset \mathbb{R}^n$ , we say that a unit covector $k \in S((\mathbb{R}^n)^\ast)$ is regular if there exists a neighbourhood $U \subset S((\mathbb{R}^n)^\ast)$ of $k$ in the unit sphere such that for all $c k' \in (\mathbb{R}^n)^\ast$ with $c \in \mathbb{R}_+$ and $k' \in U \subset S((\mathbb{R}^n)^\ast)$ the decay estimate (130) is valid for the Fourier transform $\widehat{b u}$ of $b u$ ; at $c k'$ . Otherwise $k$ is non-regular. Write

\Sigma(b u) \;\coloneqq\; \left\{ k \in S((\mathbb{R}^n)^\ast) \;\vert\; k \, \text{non-regular} \right\}

for the set of non-regular covectors of $b u$ .

The wave front set at $x$ is the intersection of these sets as $b$ ranges over bump functions whose support includes $x$ :

\Sigma_x(u) \;\coloneqq\; \underset{ { b \in C^\infty_{cp}(\mathbb{R}^n) } \atop { x \in supp(b) } }{\cap} \Sigma(b u) \,.

Finally the wave front set of $u$ is the subset of the sphere bundle $S(T^\ast \mathbb{R}^n)$ which over $x \in \mathbb{R}^n$ consists of $\Sigma_x(U) \subset T^\ast_x \mathbb{R}^n$ :

WF(u) \;\coloneqq\; \underset{x \in \mathbb{R}^n}{\cup} \Sigma_x(u) \;\subset\; S(T^\ast \mathbb{R}^n)

Often this is equivalently considered as the full conical set inside the cotangent bundle generated by the unit covectors under multiplication with positive real numbers.

(Hörmander 90, def. 8.1.2)

Remark

(wave front set is the UV divergence-direction-bundle over the singular support)

For $u \in \mathcal{D}'(\mathbb{R}^n)$ The Paley-Wiener-Schwartz theorem (prop. ) implies that

Forgetting the direction covectors in the wave front set $WF(u)$ (def. ) and remembering only the points where they are based yields the set of singlar points of $u$ , hence the singular support (def. )

$\array{ WF(u) \\ \downarrow \\ supp_{sing}(u) &\hookrightarrow& \mathbb{R}^n }$
the wave front set is empty, precisely if the singular support is empty, which is the case precisely if $u$ is a non-singular distribution.

Example

(wave front set of non-singular distribution is empty)

By prop. , the wave front set (def. ) of a non-singular distribution (prop. ) is empty. Conversely, a distribution is non-singular if its wave front set is empty:

u \in \mathcal{D}'\;\text{non-singular} \phantom{AA} \Leftrightarrow \phantom{AA} WF(u) = \emptyset

Example

(wave front set of delta distribution)

Consider the delta distribution

\delta_0 \in \mathcal{D}'(\mathbb{R}^n)

given by evaluation at the origin. Its wave front set (def. ) consists of all the directions at the origin:

WF(\delta_0) \;=\; \left\{ (0,k) \;\vert\; k \in \mathbb{R}^n \setminus \{0\} \right\} \subset \mathbb{R}^n \times \mathbb{R}^n \simeq T^\ast \mathbb{R}^n \,.

Proof

First of all the singular support (def. ) of $\delta_0$ is clearly $supp_{sing}(\delta(0)) = \{0\}$ , hence by remark the wave front set vanishes over $\mathbb{R}^n \setminus \{0\}$ .

At the origin, any bump function $b$ supported around the origin with $b(0) = 1$ satisfies $b \cdot \delta(0) = \delta(0)$ and hence the wave front set over the origin is the set of covectors along which the Fourier transform $\hat \delta(0)$ does not suitably decay. But this Fourier transform is in fact a constant function (example ) and hence does not decay in any direction.

Example

(wave front set of step function)

Let $\Theta \in \mathcal{D}'(\mathbb{R}^1)$ be the Heaviside step function given by

\Theta(b) \coloneqq \int_0^\infty b(x)\, d x \,.

Its wave front set (def. ) is

WF(H) = \{(0,k) \vert k \neq 0\} \,.

Proposition

(wave front set of convolution of compactly supported distributions)

Let $u,v \in \mathcal{E}'(\mathbb{R}^n)$ be two compactly supported distributions. Then the wave front set (def. ) of their convolution of distributions (def. ) is

WF(u \star v) \;=\; \left\{ (x + y, k) \;\vert\; (x,k) \in WF(u) \,\text{and}\, (y,k) \in WF(u) \right\} \,.

(Bengel 77, prop. 3.1)

Proposition

(Hörmander's criterion for product of distributions)

Let $u, v \in \mathcal{D}'(\mathbb{R}^n)$ be two distributions. If their wave front sets (def ) do not collide, in that for $v \in T^\ast_x X$ a covector contained in one of the two wave front sets then the covector $-v \in T^\ast_x X$ with the opposite direction in not contained in the other wave front set, i.e. the intersection fiber product inside the cotangent bundle $T^\ast X$ of the pointwise sum of wave fronts with the zero section is empty:

\left( WF(u_1) + WF(u_2) \right) \underset{T^\ast X}{\times} X \;=\; \emptyset

i.e.

\array{ && \emptyset \\ & \swarrow && \searrow \\ WF(u_1) + WF(u_2) && (pb) && X \\ & \searrow && \swarrow_{\mathrlap{0}} \\ && T^\ast X }

then the product of distributions $u \cdot v$ exists, given, locally, by the Fourier inversion of the convolution product of their Fourier transform of distributions (remark ).

For making use of wave front sets, we need a collection of results about how wave front sets change as we apply certain operations to distributions:

Proposition

(differential operator preserves or shrinks wave front set)

Let $P$ be a differential operator (def. ). Then for $u \in \mathcal{D}'$ a distribution, the wave front set (def. ) of the derivative of distributions $P u$ (def. ) is contained in the original wave front set of $u$ :

WF(P u) \subset WF(u)

(Hörmander 90, (8.1.11))

Proposition

(wave front set of product of distributions is inside fiber-wise sum of wave front sets)

Let $u,v \in \mathcal{D}'(X)$ be a pair of distributions satisfying Hörmander's criterion, so that their product of distributions $u \cdot v$ exists by prop. . Then the wave front set (def. ) of the product distribution is contained inside the fiber-wise sum of the wave front set elements of the two factors:

WF(u \cdot v) \;\subset\; (WF(u) \cup (X \times \{0\})) + (WF(v) \cup (X \times \{0\})) \,.

(Hörmander 90, theorem 8.2.10)

More generally:

Proposition

(partial product of distributions of several variables)

Let

K_1 \in \mathcal{D}'(X \times Y) \phantom{AAA} K_2 \in \mathcal{D}'(Y \times Z)

be two distributions of two variables. For their product of distributions to be defined over $Y$ , Hörmander's criterion on the pair of wave front sets $WF(K_1), WF(K_2)$ needs to hold for the wave front wave vectors along $X$ and $Y$ taken to be zero.

If this is satisfied, then composition of integral kernels (if it exists)

(K_1 \circ K_2)(-,-) \;\coloneqq\; \underset{Y}{\int} K_1(-,y) K_2(y,-) dvol_Y(y) \;\in\; \mathcal{D}'(X \times Z)

has wave front set constrained by

(131)

WF(K_1 \circ K_2) \;\subset\; \left\{ (x,z, k_x, k_z) \;\vert\; \array{ \left( (x,y,k_x,-k_y) \in WF(K_1) \,\, \text{and} \,\, (y,z,k_y, k_z) \in WF(K_2) \right) \\ \text{or} \\ \left( k_x = 0 \,\text{and}\, (y,z,0,-k_z) \in WF(K_2) \right) \\ \text{or} \\ \left( k_z = 0 \,\text{and}\, (x,y,k_x,0) \in WF(K_1) \right) } \right\}

(Hörmander 90, theorem 8.2.14)

A key fact for identifying wave front sets is the propagation of singularities theorem (prop. below). In order to state this we need the following concepts regarding symbols of differential operators:

Definition

(symbol of a differential operator)

Let

D \;=\; \underset{n \leq N}{\sum} D^{\mu_1 \cdots \mu_n} \frac{\partial}{\partial x^{\mu^1}} \cdots \frac{\partial}{\partial x^{\mu^n}} + D^0

be a differential operator on $\mathbb{R}^n$ (def. ). Then its symbol of a differential operator is the smooth function on the cotangent bundle $T^\ast \mathbb{R}^n \simeq \mathbb{R}^n \times \mathbb{R}^n$ (def. ) given by

\array{ T^\ast \mathbb{R}^n &\overset{q}{\longrightarrow}& \mathbb{C} \\ k &\mapsto& \underset{n \leq N}{\sum} D^{\mu_1 \cdots \mu_k} k_{\mu_1} \cdots k_{\mu_n} } \,.

The principal symbol is the top degree homogeneous part $D^{\mu_1 \cdots \mu_k} k_{\mu_1} \cdots k_{\mu_N}$ .

Definition

(symbol order)

A smooth function $q$ on the cotangent bundle $T^\ast \mathbb{R}^n$ (e.g. the symbol of a differential operator, def. ) is of order $m$ (and type $1,0$ , denoted $q \in S^m = S^m_{1,0}$ ), for $m \in \mathbb{N}$ , if on each coordinate chart $((x^i), (k_i))$ we have that for every compact subset $K$ of the base space and all multi-indices $\alpha$ and $\beta$ , there is a real number $C_{\alpha, \beta,K } \in \mathbb{R}$ such that the absolute value of the partial derivatives of $q$ is bounded by

\left\vert \frac{\partial^\alpha}{\partial k_\alpha} \frac{\partial^\beta}{\partial x^\beta} q(x,k) \right\vert \;\leq\; C_{\alpha,\beta,K}\left( 1+ {\vert k\vert}\right)^{m - {\vert \alpha\vert}}

for all $x \in K$ and all cotangent vectors $k$ to $x$ .

A Fourier integral operator $Q$ is of symbol class $L^m = L^m_{1,0}$ if it is of the form

Q f (x) \;=\; \int \int e^{i k \cdot (x - y)} q(x,y,k) f(y) \, d y \, d k

with symbol $q$ of order $m$ , in the above sense.

(Hörmander 71, def. 1.1.1 and first sentence of section 2.1 with (1.4.1))

Proposition

(propagation of singularities theorem)

Let $Q$ be a differential operator (def. ) of symbol class $L^m$ (def. ) with real principal symbol $q$ that is homogeneous of degree $m$ .

For $u \in \mathcal{D}'(X)$ a distribution with $Q u = f$ , then the complement of the wave front set of $u$ by that of $f$ is contained in the set of covectors on which the principal symbol $q$ vanishes:

WF(u) \setminus WF(f) \;\subset\; q^{-1}(0) \,.

Moreover, $WF(u)$ is invariant under the bicharacteristic flow induced by the Hamiltonian vector field of $q$ with respect to the canonical symplectic manifold structure on the cotangent bundle (here).

(Duistermaat-Hörmander 72, theorem 6.1.1, recalled for instance as Radzikowski 96, theorem 4.6)

$\,$

Cauchy principal value

An important application of the Fourier analysis of distributions is the class of distributions known broadly as Cauchy principal values. Below we will find that these control the detailed nature of the various propagators of free field theories, notably the Feynman propagator is manifestly a Cauchy principal value (prop. and def. below), but also the singular support properties of the causal propagator and the Wightman propagator are governed by Cauchy principal values (prop. and prop. below). This way the understanding of Cauchy principal values eventually allows us to determine the wave front set of all the propagators (prop. ) below.

Therefore we now collect some basic definitions and facts on Cauchy principal values.

The Cauchy principal value of a function which is integrable on the complement of one point is, if it exists, the limit of the integrals of the function over subsets in the complement of this point as these integration domains tend to that point symmetrically from all sides.

One also subsumes the case that the “point” is “at infinity”, hence that the function is integrable over every bounded domain. In this case the Cauchy principal value is the limit, if it exists, of the integrals of the function over bounded domains, as their bounds tend symmetrically to infinity.

The operation of sending a compactly supported smooth function (bump function) to Cauchy principal value of its pointwise product with a function $f$ that may be singular at the origin defines a distribution, usually denoted $PV(f)$ .

Definition

(Cauchy principal value of an integral over the real line)

Let $f \colon \mathbb{R} \to \mathbb{R}$ be a function on the real line such that for every positive real number $\epsilon$ its restriction to $\mathbb{R}\setminus (-\epsilon, \epsilon)$ is integrable. Then the Cauchy principal value of $f$ is, if it exists, the limit

PV(f) \coloneqq \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R} \setminus (-\epsilon, \epsilon)}{\int} f(x) \, d x \,.

Definition

(Cauchy principal value as distribution on the real line)

Let $f \colon \mathbb{R} \to \mathbb{R}$ be a function on the real line such that for all bump functions $b \in C^\infty_{cp}(\mathbb{R})$ the Cauchy principal value of the pointwise product function $f b$ exists, in the sense of def. . Then this assignment

PV(f) \;\colon\; b \mapsto PV(f b)

defines a distribution $PV(f) \in \mathcal{D}'(\mathbb{R})$ .

Example

Let $f \colon \mathbb{R} \to \mathbb{R}$ be an integrable function which is symmetric, in that $f(-x) = f(x)$ for all $x \in \mathbb{R}$ . Then the principal value integral (def. ) of $x \mapsto \frac{f(x)}{x}$ exists and is zero:

\underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}\setminus (-\epsilon, \epsilon)}{\int} \frac{f(x)}{x} d x \; = \; 0

This is because, by the symmetry of $f$ and the skew-symmetry of $x \mapsto 1/x$ , the the two contributions to the integral are equal up to a sign:

\int_{-\infty}^{-\epsilon} \frac{f(x)}{x} d x \;=\; - \int_{\epsilon}^\infty \frac{f(x)}{x} d x \,.

Example

The Cauchy principal value distribution $PV\left( \frac{1}{x}\right)$ (def. ) solves the distributional equation

(132)

x PV\left(\frac{1}{x}\right) = 1 \phantom{AAA} \in \mathcal{D}'(\mathbb{R}^1) \,.

Since the delta distribution $\delta \in \mathcal{D}'(\mathbb{R}^1)$ solves the equation

x \delta(x) = 0 \phantom{AAA} \in \mathcal{D}'(\mathbb{r}^1)

we have that more generally every linear combination of the form

(133)

F(x) \coloneqq PV(1/x) + c \delta(x) \phantom{AAA} \in \mathcal{D}'(\mathbb{R}^1)

for $c \in \mathbb{C}$ , is a distributional solution to $x F(x) = 1$ .

The wave front set of all these solutions is

WF\left( PV(1/x) + c \delta(x) \right) \;=\; \left\{ (0,k) \;\vert\; k \in \mathbb{R}^\ast \setminus \{0\} \right\} \,.

Proof

The first statement is immediate from the definition: For $b \in C^\infty_c(\mathbb{R}^1)$ any bump function we have that

\begin{aligned} \left\langle x PV\left(\frac{1}{x}\right), b \right\rangle & \coloneqq \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1 \setminus (-\epsilon, \epsilon)}{\int} \frac{x}{x}b(x) \, d x \\ & = \int b(x) d x \\ & = \langle 1,b\rangle \end{aligned}

Regarding the second statement: It is clear that the wave front set is concentrated at the origin. By symmetry of the distribution around the origin, it must contain both directions.

Proposition

In fact (133) is the most general distributional solution to (132).

This follows by the characterization of extension of distributions to a point, see there at this prop. (Hörmander 90, thm. 3.2.4)

Definition

(integration against inverse variable with imaginary offset)

Write

\tfrac{1}{x + i0^\pm} \;\in\; \mathcal{D}'(\mathbb{R})

for the distribution which is the limit in $\mathcal{D}'(\mathbb{R})$ of the non-singular distributions which are given by the smooth functions $x \mapsto \tfrac{1}{x \pm i \epsilon}$ as the positive real number $\epsilon$ tends to zero:

\frac{1}{ x + i 0^\pm } \;\coloneqq\; \underset{ { \epsilon \in (0,\infty) } \atop { \epsilon \to 0 } }{\lim} \tfrac{1}{x \pm i \epsilon}

hence the distribution which sends $b \in C^\infty(\mathbb{R}^1)$ to

b \mapsto \underset{\mathbb{R}}{\int} \frac{b(x)}{x \pm i \epsilon} \, d x \,.

Proposition

(Cauchy principal value equals integration with imaginary offset plus delta distribution)

The Cauchy principal value distribution $PV\left( \tfrac{1}{x}\right) \in \mathcal{D}'(\mathbb{R})$ (def. ) is equal to the sum of the integration over $1/x$ with imaginary offset (def. ) and a delta distribution.

PV\left(\frac{1}{x}\right) \;=\; \frac{1}{x + i 0^\pm} \pm i \pi \delta \,.

In particular, by prop. this means that $\tfrac{1}{x + i 0^\pm}$ solves the distributional equation

x \frac{1}{x + i 0^\pm} \;=\; 1 \phantom{AA} \in \mathcal{D}'(\mathbb{R}^1) \,.

Proof

Using that

\begin{aligned} \frac{1}{x \pm i \epsilon} & = \frac{ x \mp i \epsilon }{ (x + i \epsilon)(x - i \epsilon) } \\ & = \frac{ x \mp i \epsilon }{(x^2 + \epsilon^2)} \end{aligned}

we have for every bump function $b \in C^\infty_{cp}(\mathbb{R}^1)$

\begin{aligned} \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{b(x)}{x \pm i \epsilon} d x & \;=\; \underset{ (A) }{ \underbrace{ \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{x^2}{x^2 + \epsilon^2} \frac{b(x)}{x} d x } } \mp i \pi \underset{(B)}{ \underbrace{ \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{1}{\pi} \frac{\epsilon}{x^2 + \epsilon^2} b(x) \, d x }} \end{aligned}

Since

\array{ && \frac{x^2}{x^2 + \epsilon^2} \\ & {}^{\mathllap{ { {\vert x \vert} \lt \epsilon } \atop { \epsilon \to 0 } }}\swarrow && \searrow^{\mathrlap{ {{\vert x\vert} \gt \epsilon} \atop { \epsilon \to 0 } }} \\ 0 && && 1 }

it is plausible that $(A) = PV\left( \frac{b(x)}{x} \right)$ , and similarly that $(B) = b(0)$ . In detail:

\begin{aligned} (A) & = \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{x}{x^2 + \epsilon^2} b(x) d x \\ & = \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{d}{d x} \left( \tfrac{1}{2} \ln(x^2 + \epsilon^2) \right) b(x) d x \\ & = -\tfrac{1}{2} \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \ln(x^2 + \epsilon^2) \frac{d b}{d x}(x) d x \\ & = -\tfrac{1}{2} \underset{\mathbb{R}^1}{\int} \ln(x^2) \frac{d b}{d x}(x) d x \\ & = - \underset{\mathbb{R}^1}{\int} \ln({\vert x \vert}) \frac{d b}{d x}(x) d x \\ & = - \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1\setminus (-\epsilon, \epsilon)}{\int} \ln( {\vert x \vert} ) \frac{d b}{d x}(x) d x \\ & = \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1\setminus (-\epsilon, \epsilon)}{\int} \frac{1}{x} b(x) d x \\ & = PV\left( \frac{b(x)}{x} \right) \end{aligned}

and

\begin{aligned} (B) & = \tfrac{1}{\pi} \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \frac{\epsilon}{x^2 + \epsilon^2} b(x) \, d x \\ & = \tfrac{1}{\pi} \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \left( \frac{d}{d x} \arctan\left( \frac{x}{\epsilon} \right) \right) b(x) \, d x \\ & = - \tfrac{1}{\pi} \underset{\epsilon \to 0}{\lim} \underset{\mathbb{R}^1}{\int} \arctan\left( \frac{x}{\epsilon} \right) \frac{d b}{d x}(x) \, d x \\ & = - \frac{1}{2} \underset{\mathbb{R}^1}{\int} sgn(x) \frac{d b}{d x}(x) \, d x \\ & = b(0) \end{aligned}

where we used that the derivative of the arctan function is $\frac{d}{ d x} \arctan(x) = 1/(1 + x^2)$ and that $\underset{\epsilon \to + \infty}{\lim} \arctan(x/\epsilon) = \tfrac{\pi}{2}sgn(x)$ is proportional to the sign function.

Example

(Fourier integral formula for step function)

The Heaviside distribution $\Theta \in \mathcal{D}'(\mathbb{R})$ is equivalently the following Cauchy principal value (def. ):

\begin{aligned} \Theta(x) & = \frac{1}{2\pi i} \int_{-\infty}^\infty \frac{e^{i \omega x}}{\omega - i 0^+} \\ & \coloneqq \underset{ \epsilon \to 0^+}{\lim} \frac{1}{2 \pi i} \int_{-\infty}^\infty \frac{e^{i \omega x}}{\omega - i \epsilon} d\omega \,, \end{aligned}

where the limit is taken over sequences of positive real numbers $\epsilon \in (-\infty,0)$ tending to zero.

Proof

We may think of the integrand $\frac{e^{i \omega x}}{\omega - i \epsilon}$ uniquely extended to a holomorphic function on the complex plane and consider computing the given real line integral for fixed $\epsilon$ as a contour integral in the complex plane.

If $x \in (0,\infty)$ is positive, then the exponent

i \omega x = - Im(\omega) x + i Re(\omega) x

has negative real part for positive imaginary part of $\omega$ . This means that the line integral equals the complex contour integral over a contour $C_+ \subset \mathbb{C}$ closing in the upper half plane. Since $i \epsilon$ has positive imaginary part by construction, this contour does encircle the pole of the integrand $\frac{e^{i \omega x}}{\omega - i \epsilon}$ at $\omega = i \epsilon$ . Hence by the Cauchy integral formula in the case $x \gt 0$ one gets

\begin{aligned} \underset{\epsilon \to 0^+}{\lim} \frac{1}{2 \pi i} \int_{-\infty}^\infty \frac{e^{i \omega x}}{\omega - i \epsilon} d\omega & = \underset{\epsilon \to 0^+}{\lim} \frac{1}{2 \pi i} \oint_{C_+} \frac{e^{i \omega x}}{\omega - i \epsilon} d \omega \\ & = \underset{\epsilon \to 0^+}{\lim} \left(e^{i \omega x}\vert_{\omega = i \epsilon}\right) \\ & = \underset{\epsilon \to 0^+}{\lim} e^{- \epsilon x} \\ & = e^0 = 1 \end{aligned} \,.

Conversely, for $x \lt 0$ the real part of the integrand decays as the negative imaginary part increases, and hence in this case the given line integral equals the contour integral for a contour $C_- \subset \mathbb{C}$ closing in the lower half plane. Since the integrand has no pole in the lower half plane, in this case the Cauchy integral formula says that this integral is zero.

Conversely, by the Fourier inversion theorem, the Fourier transform of the Heaviside distribution is the Cauchy principal value as in prop. :

Example

(relation to Fourier transform of Heaviside distribution / Schwinger parameterization)

The Fourier transform of distributions (def. ) of the Heaviside distribution is the following Cauchy principal value:

\begin{aligned} \widehat \Theta(x) & = \int_0^\infty e^{i k x} \, dk \\ & = i \frac{1}{x + i 0^+} \end{aligned}

Here the second equality is also known as complex Schwinger parameterization.

Proof

As generalized functions consider the limit with a decaying component:

\begin{aligned} \int_0^\infty e^{i k x} \, dk & = \underset{\epsilon \to 0^+}{\lim} \int_0^\infty e^{i k x - \epsilon k} \, dk \\ & = - \underset{\epsilon \to 0^+}{\lim} \frac{1}{ i x - \epsilon} \\ & = i \frac{1}{x + i 0^+} \end{aligned}

Let now $q \colon \mathbb{R}^{n} \to \mathbb{R}$ be a non-degenerate real quadratic form analytically continued to a real quadratic form

q \;\colon\; \mathbb{C}^n \longrightarrow \mathbb{C} \,.

Write $\Delta$ for the determinant of $q$

Write $q^\ast$ for the induced quadratic form on dual vector space. Notice that $q$ (and hence $a^\ast$ ) are assumed non-degenerate but need not necessarily be positive or negative definite.

Proposition

(Fourier transform of principal value of power of quadratic form)

Let $m \in \mathbb{R}$ be any real number, and $\kappa \in \mathbb{C}$ any complex number. Then the Fourier transform of distributions of $1/(q + m^2 + i 0^+)^\kappa$ is

\widehat { \left( \frac{1}{(q + m^2 + i0^+)^\kappa} \right) } \;=\; \frac{ 2^{1- \kappa} (\sqrt{2\pi})^{n} m^{n/2-\kappa} } { \Gamma(\kappa) \sqrt{\Delta} } \frac{ K_{n/2 - \kappa}\left( m \sqrt{q^\ast - i 0^+} \right) } { \left(\sqrt{q^\ast - i0^+ }\right)^{n/2 - \kappa} } \,,

where

$\Gamma$ deotes the Gamma function
$K_{\nu}$ denotes the modified Bessel function of order $\nu$ .

Notice that $K_\nu(a)$ diverges for $a \to 0$ as $a^{-\nu}$ (DLMF 10.30.2).

(Gel’fand-Shilov 66, III 2.8 (8) and (9), p 289)

Proposition

(Fourier transform of delta distribution applied to mass shell)

Let $m \in \mathbb{R}$ , then the Fourier transform of distributions of the delta distribution $\delta$ applied to the “mass shell” $q + m^2$ is

\widehat{ \delta(q + m^2) } \;=\; - \frac{i}{\sqrt{{\vert\Delta\vert}}} \left( e^{i \pi t /2 } \frac{ K_{n/2-1} \left( m \sqrt{ q^\ast + i0^+ } \right) }{ \left(\sqrt{q^\ast + i0^+}\right)^{n/2 - 1} } \;-\; e^{-i \pi t /2 } \frac{ K_{n/2-1} \left( m \sqrt{ q^\ast - i0^+ } \right) }{ \left(\sqrt{q^\ast - i0^+}\right)^{n/2 - 1} } \right) \,,

where $K_\nu$ denotes the modified Bessel function of order $\nu$ .

Notice that $K_\nu(a)$ diverges for $a \to 0$ as $a^{-\nu}$ (DLMF 10.30.2).

(Gel’fand-Shilov 66, III 2.11 (7), p 294)

$\,$

propagators for the free scalar field on Minkowski spacetime

Advanced and regarded propagators
Causal propagator
Wightman propagator
Feynman propagator
Singular support and Wave front sets

$\,$

On Minkowski spacetime $\mathbb{R}^{p,1}$ consider the Klein-Gordon operator (example )

\eta^{\mu \nu} \frac{\partial}{\partial x^\mu} \frac{\partial}{\partial x^\nu} \Phi - \left( \tfrac{m c}{\hbar} \right)^2 \Phi \;=\; 0 \,.

By example its Fourier transform is

- k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \;=\; (k_0)^2 - {\vert \vec k\vert}^2 - \left( \tfrac{m c}{\hbar} \right)^2 \,.

The dispersion relation of this equation we write (see def. )

(134)

\omega(\vec k) \;\coloneqq\; + c \sqrt{ {\vert \vec k \vert}^2 + \left( \tfrac{m c}{\hbar}\right)^2 } \,,

where on the right we choose the non-negative square root.

$\,$

advanced and retarded propagators for Klein-Gordon equation on Minkowski spacetime

Proposition

(mode expansion of advanced and retarded propagators for Klein-Gordon operator on Minkowski spacetime)

The advanced and retarded Green functions $G_\pm$ (def. ) of the Klein-Gordon operator on Minkowski spacetime (example ) are induced from integral kernels (“propagators”), hence distributions in two variables

\Delta_\pm \in \mathcal{D}'(\mathbb{R}^{p,1}\times \mathbb{R}^{p,1})

by (in generalized function-notation, prop. )

G_\pm(\Phi) \;=\; \underset{\mathbb{R}^{p,1}}{\int} \Delta_{\pm}(x,y) \Phi(y) \, dvol(y)

where the advanced and retarded propagators $\Delta_{\pm}(x,y)$ have the following equivalent expressions:

(135)

\begin{aligned} \Delta_\pm(x-y) & = \frac{1}{(2\pi)^{p+1}} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \int \int \frac{ e^{i k_0 (x^0 - y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ (k_0 \mp i\epsilon)^2 - {\vert \vec k\vert}^2 -\left( \tfrac{m c}{\hbar}\right)^2 } \, d k_0 \, d^p \vec k \\ & = \left\{ \array{ \frac{\pm i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{+i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y) } \right) d^p \vec k & \vert & \text{if} \, \pm (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \\ & = \left\{ \array{ \frac{\mp 1}{(2\pi)^{p}} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } d^p \vec k & \vert & \text{if} \, \pm (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \end{aligned}

Here $\omega(\vec k)$ denotes the dispersion relation (134) of the Klein-Gordon equation.

Proof

The Klein-Gordon operator is a Green hyperbolic differential operator (example ) therefore its advanced and retarded Green functions exist uniquely (prop. ). Moreover, prop. says that they are continuous linear functionals with respect to the topological vector space structures on spaces of smooth sections (def. ). In the case of the Klein-Gordon operator this just means that

G_{\pm} \;\colon\; C^\infty_{cp}(\mathbb{R}^{p,1}) \longrightarrow C^\infty_{\pm cp}(\mathbb{R}^{p,1})

are continuous linear functionals in the standard sense of distributions. Therefore the Schwartz kernel theorem implies the existence of integral kernels being distributions in two variables

\Delta_{\pm} \in \mathcal{D}(\mathbb{R}^{p,1} \times \mathbb{R}^{p,1})

such that, in the notation of generalized functions,

(G_\pm \alpha)(x) \;=\; \underset{\mathbb{R}^{p,1}}{\int} \Delta_{\pm}(x,y) \alpha(y) \, dvol(y) \,.

These integral kernels are the advanced/retarded “propagators”. We now compute these integral kernels by making an Ansatz and showing that it has the defining properties, which identifies them by the uniqueness statement of prop. .

We make use of the fact that the Klein-Gordon equation is invariant under the defnining action of the Poincaré group on Minkowski spacetime, which is a semidirect product group of the translation group and the Lorentz group.

Since the Klein-Gordon operator is invariant, in particular, under translations in $\mathbb{R}^{p,1}$ it is clear that the propagators, as a distribution in two variables, depend only on the difference of its two arguments

(136)

\Delta_{\pm}(x,y) = \Delta_{\pm}(x-y) \,.

Since moreover the Klein-Gordon operator is formally self-adjoint (this prop.) this implies that for $P$ the Klein the equation (93)

P \circ G_\pm = id

is equivalent to the equation (92)

G_\pm \circ P = id \,.

Therefore it is sufficient to solve for the first of these two equation, subject to the defining support conditions. In terms of the propagator integral kernels this means that we have to solve the distributional equation

(137)

\left( \eta^{\mu \nu} \frac{\partial}{\partial x^\mu} \frac{\partial}{\partial x^\nu} - \left( \tfrac{m c}{\hbar} \right)^2 \right) \Delta_\pm(x-y) \;=\; \delta(x-y)

subject to the condition that the distributional support (def. ) is

supp\left( \Delta_{\pm}(x-y) \right) \subset \left\{ {\vert x-y\vert^2_\eta}\lt 0 \;\,,\; \pm(x^0 - y^ 0) \gt 0 \right\} \,.

We make the Ansatz that we assume that $\Delta_{\pm}$ , as a distribution in a single variable $x-y$ , is a tempered distribution

\Delta_\pm \in \mathcal{S}'(\mathbb{R}^{p,1}) \,,

hence amenable to Fourier transform of distributions (def. ). If we do find a solution this way, it is guaranteed to be the unique solution by prop. .

By example the distributional Fourier transform of equation (137) is

(138)

\begin{aligned} \left( - \eta^{\mu \nu} k_\mu k_\nu - \left( \tfrac{m c}{\hbar} \right)^2 \right) \widehat{\Delta_{\pm}}(k) & = \widehat{\delta}(k) \\ & = 1 \end{aligned} \,,

where in the second line we used the Fourier transform of the delta distribution from example .

Notice that this implies that the Fourier transform of the causal propagator (95)

\Delta_S \coloneqq \Delta_+ - \Delta_-

satisfies the homogeneous equation:

(139)

\left( - \eta^{\mu \nu} k_\mu k_\nu - \left( \tfrac{m c}{\hbar} \right)^2 \right) \widehat{\Delta_S}(k) \;=\; 0 \,,

Hence we are now reduced to finding solutions $\widehat{\Delta_\pm} \in \mathcal{S}'(\mathbb{R}^{p,1})$ to (138) such that their Fourier inverse $\Delta_\pm$ has the required support properties.

We discuss this by a variant of the Cauchy principal value:

Suppose the following limit of non-singular distributions in the variable $k \in \mathbb{R}^{p,1}$ exists in the space of distributions

(140)

\underset{ {\epsilon \in (0,\infty)} \atop { \epsilon \to 0 } }{\lim} \frac{1}{ (k_0 \mp i \epsilon)^2 - {\vert \vec k\vert^2} - \left( \tfrac{m c}{\hbar} \right)^2 } \;\in\; \mathcal{D}'(\mathbb{R}^{p,1})

meaning that for each bump function $b \in C^\infty_{cp}(\mathbb{R}^{p,1})$ the limit in $\mathbb{C}$

\underset{ {\epsilon \in (0,\infty)} \atop { \epsilon \to 0 } }{\lim} \underset{\mathbb{R}^{p,1}}{\int} \frac{b(k)}{ (k_0\mp i \epsilon)^2 - {\vert \vec k\vert}^2 - \left( \tfrac{m c}{\hbar} \right)^2 } d^{p+1}k \;\in\; \mathbb{C}

exists. Then this limit is clearly a solution to the distributional equation (138) because on those bump functions $b(k)$ which happen to be products with $\left(-\eta^{\mu \nu}k_\mu k^\nu - \left( \tfrac{m c}{\hbar}\right)^2\right)$ we clearly have

(141)

\begin{aligned} \underset{ {\epsilon \in (0,\infty)} \atop { \epsilon \to 0 } }{\lim} \underset{\mathbb{R}^{p,1}}{\int} \frac{ \left( -\eta^{\mu \nu} k_\mu k_\nu - \left( \tfrac{m c}{\hbar} \right)^2 \right) b(k) }{ (k_0\mp i \epsilon)^2 - {\vert \vec k\vert}^2 - \left( \tfrac{m c}{\hbar} \right)^2 } d^{p+1}k & = \underset{\mathbb{R}^{p,1}}{\int} \underset{= 1}{ \underbrace{ \underset{ {\epsilon \in (0,\infty)} \atop { \epsilon \to 0 } }{\lim} \frac{ \left( -\eta^{\mu \nu} k_\mu k_\nu - \left( \tfrac{m c}{\hbar} \right)^2 \right) }{ (k_0\mp i \epsilon)^2 - {\vert \vec k\vert}^2 - \left( \tfrac{m c}{\hbar} \right)^2 } } } b(k)\, d^{p+1}k \\ & = \langle 1, b\rangle \,. \end{aligned}

Moreover, if the limiting distribution (140) exists, then it is clearly a tempered distribution, hence we may apply Fourier inversion to obtain Green functions

(142)

\Delta_{\pm}(x,y) \;\coloneqq\; \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{1}{(2\pi)^{p+1}} \underset{\mathbb{R}^{p,1}}{\int} \frac{e^{i k_\mu (x-y)^\mu}}{ (k_0 \mp i \epsilon )^2 - {\vert \vec k\vert}^2 - \left(\tfrac{m c}{\hbar}\right)^2 } d k_0 d^p \vec k \,.

To see that this is the correct answer, we need to check the defining support property.

Finally, by the Fourier inversion theorem, to show that the limit (140) indeed exists it is sufficient to show that the limit in (142) exists.

We compute as follows

(143)

\begin{aligned} \Delta_\pm(x-y) & = \frac{1}{(2\pi)^{p+1}} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \int \int \frac{ e^{i k_0 (x^0 - y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ (k_0 \mp i\epsilon)^2 - {\vert \vec k\vert}^2 -\left( \tfrac{m c}{\hbar}\right)^2 } \, d k_0 \, d^p \vec k \\ & = \frac{1}{(2\pi)^{p+1}} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \int \int \frac{ e^{i k_0 (x^0 - y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ (k_0 \mp i \epsilon)^2 - \left(\omega(\vec k)/c\right)^2 } \, d k_0 \, d^p \vec k \\ &= \frac{1}{(2\pi)^{p+1}} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \int \int \frac{ e^{i k_0 (x^0 - y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ \left( (k_0 \mp i\epsilon) - \omega(\vec k)/c \right) \left( (k_0 \mp i \epsilon) + \omega(\vec k)/c \right) } \, d k_0 \, d^p \vec k \\ & = \left\{ \array{ \frac{\pm i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y)} \right) d^p \vec k & \vert & \text{if} \, \pm (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \\ & = \left\{ \array{ \frac{\mp 1}{(2\pi)^{p}} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } d^p \vec k & \vert & \text{if} \, \pm (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \end{aligned}

where $\omega(\vec k)$ denotes the dispersion relation (134) of the Klein-Gordon equation. The last step is simply the application of Euler's formula $\sin(\alpha) = \tfrac{1}{2 i }\left( e^{i \alpha} - e^{- i \alpha}\right)$ .

Here the key step is the application of Cauchy's integral formula in the fourth step. We spell this out now for $\Delta_+$ , the discussion for $\Delta_-$ is the same, just with the appropriate signs reversed.

If $(x^0 - y^0) \gt 0$ thn the expression $e^{ik_0 (x^0 - y^0)}$ decays with positive imaginary part of $k_0$ , so that we may expand the integration domain into the upper half plane as

\begin{aligned} \int_{-\infty}^\infty d k_0 & = \phantom{+} \int_{-\infty}^0 d k_0 + \int_{0}^{+ i \infty} d k_0 \\ & = + \int_{+i \infty}^0 d k_0 + \int_0^\infty d k_0 \,; \end{aligned}

Conversely, if $(x^0 - y^0) \lt 0$ then we may analogously expand into the lower half plane.

This integration domain may then further be completed to two contour integrations. For the expansion into the upper half plane these encircle counter-clockwise the poles at $\pm \omega(\vec k)+ i\epsilon \in \mathbb{C}$ , while for expansion into the lower half plane no poles are being encircled.

Apply Cauchy's integral formula to find in the case $(x^0 - y^0)\gt 0$ the sum of the residues at these two poles times $2\pi i$ , zero in the other case. (For the retarded propagator we get $- 2 \pi i$ times the residues, because now the contours encircling non-trivial poles go clockwise).
The result is now non-singular at $\epsilon = 0$ and therefore the limit $\epsilon \to 0$ is now computed by evaluating at $\epsilon = 0$ .

This computation shows a) that the limiting distribution indeed exists, and b) that the support of $\Delta_+$ is in the future, and that of $\Delta_-$ is in the past.

Hence it only remains to see now that the support of $\Delta_\pm$ is inside the causal cone. But this follows from the previous argument, by using that the Klein-Gordon equation is invariant under Lorentz transformations: This implies that the support is in fact in the future of every spacelike slice through the origin in $\mathbb{R}^{p,1}$ , hence in the closed future cone of the origin.

Proposition

(causal propagator is skew-symmetric)
Under reversal of arguments the advanced and retarded causal propagators from prop. are related by

(144)

\Delta_{\pm}(y-x) = \Delta_\mp(x-y) \,.

It follows that the causal propagator (95) $\Delta \coloneqq \Delta_+ - \Delta_-$ is skew-symmetric in its arguments:

\Delta_S(x-y) = - \Delta_S(y-x) \,.

Proof

By prop. we have with (135)

\begin{aligned} \Delta_\pm(y-x) & = \left\{ \array{ \frac{\pm i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{-i \omega(\vec k)(x^0 - y^0)/c - i \vec k \cdot (\vec x -\vec y)} - e^{+i \omega(\vec k)(x^0 - y^0)/c - i \vec k \cdot (\vec x - \vec y) } \right) d^p \vec k & \vert & \text{if} \, \mp (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \\ & = \left\{ \array{ \frac{\pm i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{+i \omega(\vec k)(x^0 - y^0)/c - i \vec k \cdot (\vec x - \vec y) } \right) d^p \vec k & \vert & \text{if} \, \mp (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \\ & = \left\{ \array{ \frac{\mp i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{+i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c - i \vec k \cdot (\vec x - \vec y) } \right) d^p \vec k & \vert & \text{if} \, \mp (x^0 - y^0) \gt 0 \\ 0 & \vert & \text{otherwise} } \right. \\ & = \Delta_\mp(x-y) \end{aligned}

Here in the second step we applied change of integration variables $\vec k \mapsto - \vec k$ (which introduces no sign because in addition to $d \vec k \mapsto - d \vec k$ the integration domain reverses orientation).

$\,$

causal propagator

Proposition

(mode expansion of causal propagator for Klein-Gordon equation on Minkowski spacetime)

The causal propagator (95) for the Klein-Gordon equation for mass $m$ on Minkowski spacetime $\mathbb{R}^{p,1}$ (example ) is given, in generalized function notation, by

(145)

\begin{aligned} \Delta_S(x,y) & = \frac{+ i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y)} \right) d^p \vec k \\ & = \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x -\vec y)} d^p \vec k \,, \end{aligned}

where in the second line we used Euler's formula $sin(\alpha)= \tfrac{1}{2i}\left( e^{i \alpha} - e^{-i \alpha} \right)$ .

In particular this shows that the causal propagator is real, in that it is equal to its complex conjugate

(146)

\left(\Delta_S(x,y)\right)^\ast = \Delta_S(x,y) \,.

Proof

By definition and using the expression from prop. for the advanced and retarded causal propagators we have

\begin{aligned} \Delta_S(x,y) & \coloneqq \Delta_+(x,y) - \Delta_-(x,y) \\ & = \left\{ \array{ \frac{+ i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y)} \right) d^p \vec k & \vert & \text{if} \, + (x^0 - y^0) \gt 0 \\ \frac{(-1) (-1) i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y)} \right) d^p \vec k & \vert & \text{if} \, - (x^0 - y^0) \gt 0 } \right. \\ & = \frac{+ i}{(2\pi)^{p}} \int \frac{1}{2\omega(\vec k)/c} \left( e^{i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x -\vec y)} - e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y)} \right) d^p \vec k \\ & = \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x -\vec y)} d^p \vec k \end{aligned}

For the reality, notice from the last line that

\begin{aligned} \left(\Delta_S(x,y)\right)^\ast & = \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{-i \vec k \cdot (\vec x -\vec y)} d^p \vec k \\ & = \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{+i \vec k \cdot (\vec x -\vec y)} d^p \vec k \\ & = \Delta_S(x,y) \,, \end{aligned}

where in the last step we used the change of integration variables $\vec k \mapsto - \vec k$ (whih introduces no sign, since on top of $d \vec k \mapsto - d \vec k$ the orientation of the integration domain changes).

We consider a couple of equivalent expressions for the causal propagator which are useful for computations:

Proposition

(causal propagator for Klein-Gordon operator on Minkowski spacetime as a contour integral)

The causal propagator (prop. ) for the Klein-Gordon equation at mass $m$ on Minkowski spacetime (example ) has the following equivalent expression, as a generalized function, given as a contour integral along a Jordan curve $C(\vec k)$ going counter-clockwise around the two poles at $k_0 = \pm \omega(\vec k)/c$ :

\Delta_S(x,y) \;=\; (2\pi)^{-(p+1)} \int \underset{C(\vec k)}{\oint} \frac{e^{i k_\mu (x-y)^\mu}}{ -k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2g } \,d k_0 \,d^{p} k \,.

graphics grabbed from Kocic 16

Proof

By Cauchy's integral formula we compute as follows:

\begin{aligned} (2\pi)^{-(p+1)} \int \underset{C(\vec k)}{\oint} \frac{e^{i k_\mu (x^\mu - y^\mu)}}{ -k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 } \,d k_0 \,d^{p} k & = (2\pi)^{-(p+1)} \int \underset{C(\vec k)}{\oint} \frac{ e^{i k_0 x^0} e^{ i \vec k \cdot (\vec x - \vec y)} }{ k_0^2 - \omega(\vec k)^2/c^2 } \,d k_0 \,d^p \vec k \\ & = (2\pi)^{-(p+1)} \int \underset{C(\vec k)}{\oint} \frac{ e^{i k_0 (x^0 - y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ ( k_0 + \omega(\vec k)/c ) ( k_0 - \omega(\vec k)/c ) } \,d k_0 \,d^p \vec k \\ & = (2\pi)^{-(p+1)} 2\pi i \int \left( \frac{ e^{i \omega(\vec k) (x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} } { 2 \omega(\vec k)/c } - \frac{ e^{ - i \omega(\vec k) (x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} }{ 2 \omega(\vec k)/c } \right) \,d^p \vec k \\ & = i (2\pi)^{-p} \int \frac{1}{\omega(\vec k)/c} sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y)} \,d^p \vec k \,. \end{aligned}

The last line is the expression for the causal propagator from prop.

Proposition

(causal propagator as Fourier transform of delta distribution on the Fourier transformed Klein-Gordon operator)

The causal propagator for the Klein-Gordon equation at mass $m$ on Minkowski spacetime has the following equivalent expression, as a generalized function:

\Delta_S(x,y) \;=\; i (2\pi)^{-p} \int \delta\left( k_\mu k^\mu + \left( \tfrac{m c}{\hbar}\right)^2 \right) sgn( k_0 ) e^{ i k_\mu (x-y)^\mu } d^{p+1} k \,,

where the integrand is the product of the sign function of $k_0$ with the delta distribution of the Fourier transform of the Klein-Gordon operator and a plane wave factor.

Proof

By decomposing the integral over $k_0$ into its negative and its positive half, and applying the change of integration variables $k_0 = \pm\sqrt{h}$ we get

\begin{aligned} i (2\pi)^{-p} \int \delta\left( k_\mu k^\mu + \left( \tfrac{m c}{\hbar}\right)^2 \right) sgn( k_0 ) e^{ i k_\mu (x-y)^\mu } d^{p+1} k & = + i (2\pi)^{-p} \int \int_0^\infty \delta\left( -k_0^2 + \vec k^2 + \left( \tfrac{m c}{\hbar}\right)^2 \right) e^{ i k_0 (x^0 - y^0) + i \vec k \cdot (\vec x - \vec y)} d k_0 \, d^p \vec k \\ & \phantom{=} - i (2\pi)^{-p} \int \int_{-\infty}^0 \delta\left( -k_0^2 + \vec k^2 + \left(\tfrac{m c}{\hbar}\right)^2 \right) e^{ i k_0 (x^0 - y^0)+ i \vec k \cdot (\vec x - \vec y) } d k_0 \, d^{p} \vec k \\ & = +i (2\pi)^{-p} \int \int_0^\infty \frac{1}{2 \sqrt{h}} \delta\left( -h + \omega(\vec k)^2/c^2 \right) e^{ + i \sqrt{h} (x^0 - y^0) + i \vec k \cdot \vec x } d h \, d^{p} \vec k \\ & \phantom{=} - i (2\pi)^{-p} \int \int_0^\infty \frac{1}{2 \sqrt{h}} \delta\left( - h + \omega(\vec k)^2/c^2 \right) e^{ - i \sqrt{h} (x^0 - y^0) + i \vec k \cdot \vec x } d h \, d^{p} \vec k \\ & = +i (2\pi)^{-p} \int \frac{1}{2 \omega(\vec k)/c} e^{ i \omega(\vec k) (x-y)^0/c + i \vec k \cdot \vec x} d^{p} \vec k \\ & \phantom{=} - i (2\pi)^{-p} \int \frac{1}{2 \omega(\vec k)/c} e^{ - i \omega(\vec k) (x-y)^0/c + i \vec k \cdot \vec x } d^{p} \vec k \\ & = -(2 \pi)^{-p} \int \frac{1}{\omega(\vec k)/c} sin\left( \omega(\vec k)(x-y)^0/c \right) e^{i \vec k \cdot (\vec x - \vec y)} \end{aligned}

The last line is the expression for the causal propagator from prop. .

$\,$

Wightman propagator

Prop. exhibits the causal propagator of the Klein-Gordon operator on Minkowski spacetime as the difference of a contribution for positive temporal angular frequency $k_0 \propto \omega(\vec k)$ (hence positive energy $\hbar \omega(\vec k)$ and a contribution of negative temporal angular frequency.

The positive frequency contribution to the causal propagator is called the Wightman propagator (def. below), also known as the the vacuum state 2-point function of the free real scalar field on Minkowski spacetime. Notice that the temporal component of the wave vector is proportional to the negative angular frequency

k_0 = -\omega/c

(see at plane wave), therefore the appearance of the step function $\Theta(-k_0)$ in (147) below:

Definition

(Wightman propagator or vacuum state 2-point function for Klein-Gordon operator on Minkowski spacetime)

The Wightman propagator for the Klein-Gordon operator at mass $m$ on Minkowski spacetime (example ) is the tempered distribution in two variables $\Delta_H \in \mathcal{S}'(\mathbb{R}^{p,1})$ which as a generalized function is given by the expression

(147)

\begin{aligned} \Delta_H(x,y) & \coloneqq \frac{1}{(2\pi)^p} \int \delta\left( k_\mu k^\mu + m^2 \right) \Theta( -k_0 ) e^{i k_\mu (x^\mu-y^\mu) } \, d^{p+1} k \\ & = \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k \,, \end{aligned}

Here in the first line we have in the integrand the delta distribution of the Fourier transform of the Klein-Gordon operator times a plane wave and times the step function $\Theta$ of the temporal component of the wave vector. In the second line we used the change of integration variables $k_0 = \sqrt{h}$ , then the definition of the delta distribution and the fact that $\omega(\vec k)$ is by definition the non-negative solution to the Klein-Gordon dispersion relation.

(e.g. Khavkine-Moretti 14, equation (38) and section 3.4)

Proposition

(Wightman propagator on Minkowski spacetime is distributional solution to Klein-Gordon equation)

The Wightman propagator $\Delta_H$ (def. ) is a distributional solution (def. ) to the Klein-Gordon equation

(\Box_x - m^2)\Delta_H(x,y) = 0 \,.

Proof

By definition the Wightman propagator is the Fourier transform of distributions of the product of distributions

\delta(k_\mu k^\mu + m^2) \Theta(-k_0) \,,

where in turn the argument of the delta distribution is just $-1$ times the Fourier transform of the [Klein-Gordon operator]] itself (prop. ). This is clearly a solution to the equation

(-k_\mu k^\mu - m^2) \, \delta(k_\mu k^\mu + m^2) \Theta(-k_0) \;=\; 0 \,.

Under Fourier inversion (prop. ), this is the equation $(\Box_x - m^2)\Delta_H(x,y) = 0$ , as in the proof of prop. .

Proposition

(contour integral representation of the Wightman propagator for the Klein-Gordon operator on Minkowski spacetime)

The Wightman propagator from def. is equivalently given by the contour integral

(148)

\Delta_H(x,y) \;=\; -i(2\pi)^{-(p+1)} \int \underset{C_+(\vec k)}{\oint} \frac{e^{-i k_\mu (x-y)^\mu}}{ -k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 } d k_0 d^{p} k \,,

where the Jordan curve $C_+(\vec k) \subset \mathbb{C}$ runs counter-clockwise, enclosing the point $+ \omega(\vec k)/c \in \mathbb{R} \subset \mathbb{C}$ , but not enclosing the point $- \omega(\vec k)/c \in \mathbb{R} \subset \mathbb{C}$ .

graphics grabbed from Kocic 16

Proof

We compute as follows:

\begin{aligned} -i(2\pi)^{-(p+1)} \int \underset{C_+(\vec k)}{\oint} \frac{e^{ - i k_\mu (x-y)^\mu}}{ -k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 } d k_0 d^{p} k & = -i(2\pi)^{-(p+1)} \int \oint_{C_+(\vec k)} \frac{ e^{ -i k_0 x^0} e^{i \vec k \cdot (\vec x - \vec y)} }{ k_0^2 - \omega(\vec k)^2/c^2 } d k_0 d^p \vec k \\ & = -i(2\pi)^{-(p+1)} \int \underset{C_+(\vec k)}{\oint} \frac{ e^{ - i k_0 (x^0-y^0)} e^{i \vec k \cdot (\vec x - \vec y)} }{ ( k_0 - \omega_\epsilon(\vec k) ) ( k_0 + \omega_\epsilon(\vec k) ) } d k_0 d^p \vec k \\ & = (2\pi)^{-p} \int \frac{1}{2 \omega(\vec k)} e^{-i \omega(\vec k) (x^0-y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} d^p \vec k \,. \end{aligned}

The last step is application of Cauchy's integral formula, which says that the contour integral picks up the residue of the pole of the integrand at $+ \omega(\vec k)/c \in \mathbb{R} \subset \mathbb{C}$ . The last line is $\Delta_H(x,y)$ , by definition .

Proposition

(skew-symmetric part of Wightman propagator is the causal propagator)

The Wightman propagator for the Klein-Gordon equation on Minkowski spacetime (def. ) is of the form

(149)

\begin{aligned} \Delta_H & = \tfrac{i}{2} \Delta_S + H \\ & = \tfrac{i}{2} \left( \Delta_+ - \Delta_- \right) + H \end{aligned} \,,

where

$\Delta_S$ is the causal propagator (prop. ), which is real (146) and skew-symmetric (prop. )

$(\Delta_S(x,y))^\ast = \Delta_S(x,y) \phantom{AA} \,, \phantom{AA} \Delta_S(y,x) = - \Delta_S(x,y)$
$H$ is real and symmetric

(150) $(H(x,y))^\ast = H(x,y) \phantom{AA} \,, \phantom{AA} H(y,x) = H(x,y)$

Proof

By applying Euler's formula to (147) we obtain

(151)

\begin{aligned} \Delta_H(x,y) & = \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k \\ & = \tfrac{i}{2} \underset{= \Delta_S(x,y)}{ \underbrace{ \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k }} \;+\; \underset{ \coloneqq H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k }} \end{aligned}

On the left this identifies the causal propagator by (145), prop. .

The second summand changes, both under complex conjugation as well as under $(x-y) \mapsto (y-x)$ , via change of integration variables $\vec k \mapsto - \vec k$ (because the cosine is an even function). This does not change the integral, and hence $H$ is symmetric.

$\,$

Feynman propagator

We have seen that the positive frequency component of the causal propagator $\Delta_S$ for the Klein-Gordon equation on Minkowski spacetime (prop. ) is the Wightman propagator $\Delta_H$ (def. ) given, according to prop. , by (149)

\begin{aligned} \Delta_H & = \tfrac{i}{2} \Delta_S + H \\ & = \tfrac{i}{2} \left( \Delta_+ - \Delta_- \right) + H \end{aligned} \,,

There is an evident variant of this combination, which will be of interest:

Definition

(Feynman propagator for Klein-Gordon equation on Minkowski spacetime)

The Feynman propagator for the Klein-Gordon equation on Minkowski spacetime (example ) is the linear combination

\Delta_F \coloneqq \tfrac{i}{2} \left( \Delta_+ + \Delta_- \right) + H

where the first term is proportional to the sum of the advanced and retarded propagators (prop. ) and the second is the symmetric part of the Wightman propagator according to prop. .

Similarly the anti-Feynman propagator is

\Delta_{\overline{F}} \coloneqq \tfrac{i}{2} \left( \Delta_+ + \Delta_- \right) - H \,.

It follows immediately that:

Proposition

(Feynman propagator is symmetric)

The Feynman propagator $\Delta_F$ and anti-Feynman propagator $\Delta_{\overline{F}}$ (def. ) are symmetric:

\Delta_F(x,y) = \Delta_F(y,x) \,.

Proof

By equation (144) in cor. we have that $\Delta_+ + \Delta_-$ is symmetric, and equation (150) in prop. says that $H$ is symmetric.

Proposition

(mode expansion for Feynman propagator of Klein-Gordon equation on Minkowski spacetime)

The Feynman propagator (def. ) for the Klein-Gordon equation on Minkowski spacetime is given by the following equivalent expressions

\begin{aligned} \Delta_F(x,y) & = \left\{ \array{ \frac{1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)c} e^{-i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \gt 0 \\ \frac{1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{+i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ \Delta_H(x,y) &\vert& (x^0 - y^0) \gt 0 \\ \Delta_H(y,x) &\vert& (x^0 - y^0) \lt 0 } \right. \end{aligned}

Similarly the anti-Feynman propagator is equivalently given by

\begin{aligned} \Delta_{\overline{F}}(x,y) & = \left\{ \array{ \frac{-}{(2\pi)^p} \int \frac{1}{\omega(\vec k)c} e^{+i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \gt 0 \\ \frac{-}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{-i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ -\Delta_H(y,x) &\vert& (x^0 - y^0) \gt 0 \\ -\Delta_H(x,y) &\vert& (x^0 - y^0) \lt 0 } \right. \end{aligned}

Proof

By the mode expansion of $\Delta_{\pm}$ from (135) and the mode expansion of $H$ from (151) we have

\begin{aligned} \Delta_F(x,y) & = \left\{ \array{ \underset{ = \tfrac{i}{2} \Delta_+(x,y) + 0 \;\text{for}\; (x^0 - y^0) \gt 0 }{ \underbrace{ \frac{- i}{(2\pi)^{p}} \int \frac{1}{2 \omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } + \underset{ = H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } &\vert& (x^0 - y^0) \gt 0 \\ \underset{ = 0 + \tfrac{i}{2}\Delta_-(x,y) \;\text{for}\; (x^0 - y^0) \lt 0 }{ \underbrace{ \frac{+ i}{(2\pi)^{p}} \int \frac{1}{2 \omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } + \underset{ = H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ \frac{1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{-i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \gt 0 \\ \frac{1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{+i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ \Delta_H(x,y) &\vert& (x^0 - y^0) \gt 0 \\ \Delta_H(y,x) &\vert& (x^0 - y^0) \lt 0 } \right. \end{aligned}

where in the second line we used Euler's formula. The last line follows by comparison with (147) and using that the integral over $\vec k$ is invariant under $\vec k \mapsto - \vec k$ .

The computation for $\Delta_{\overline{F}}$ is the same, only now with a minus sign in front of the cosine:

\begin{aligned} \Delta_{\overline{F}}(x,y) & = \left\{ \array{ \underset{ = \tfrac{i}{2} \Delta_+(x,y) + 0 \;\text{for}\; (x^0 - y^0) \gt 0 }{ \underbrace{ \frac{- i}{(2\pi)^{p}} \int \frac{1}{2 \omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } - \underset{ = H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } &\vert& (x^0 - y^0) \gt 0 \\ \underset{ = 0 + \tfrac{i}{2}\Delta_-(x,y) \;\text{for}\; (x^0 - y^0) \lt 0 }{ \underbrace{ \frac{+ i}{(2\pi)^{p}} \int \frac{1}{2 \omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } - \underset{ = H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k } } &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ \frac{-1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{+i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \gt 0 \\ \frac{-1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{-1i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ - \Delta_H(y,x) &\vert& (x^0 - y^0) \gt 0 \\ - \Delta_H(x,y) &\vert& (x^0 - y^0) \lt 0 } \right. \end{aligned}

As before for the causal propagator, there are equivalent reformulations of the Feynman propagator which are useful for computations:

Proposition

(Feynman propagator as a Cauchy principal value)

The Feynman propagator and anti-Feynman propagator (def. ) for the Klein-Gordon equation on Minkowski spacetime is equivalently given by the following expressions, respectively:

\begin{aligned} \left. \array{ \Delta_F(x,y) \\ \Delta_{\overline{F}}(x,y) } \right\} & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu (x^\mu - y^\mu)} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k \end{aligned}

where we have a limit of distributions as for the Cauchy principal value (this prop).

Proof

We compute as follows:

\begin{aligned} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu (x^\mu - y^\mu)} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu (x^\mu - y^\mu)} }{ (k_0)^2 - \underset{ \coloneqq \omega_{\pm\epsilon}(\vec k)^2/c^2 }{\underbrace{ \left( \omega(\vec k)^2/c^2 \pm i \epsilon \right) }} } \, d k_0 \, d^p \vec k \\ & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu (x^\mu - y^\mu)} }{ \left( k_0 - \omega_{\pm \epsilon}(\vec k)/c \right) \left( k_0 + \omega_{\pm \epsilon}(\vec k)/c \right) } \, d k_0 \, d^p \vec k \\ & = \left\{ \array{ \frac{\mp 1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{\pm i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \gt 0 \\ \frac{\mp 1}{(2\pi)^p} \int \frac{1}{2\omega(\vec k)c} e^{\mp i\omega(\vec k)(x^0 - y^0)/c} e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k &\vert& (x^0 - y^0) \lt 0 } \right. \\ & = \left\{ \array{ \Delta_F(x,y) \\ \Delta_{\overline{F}}(x,y) } \right. \end{aligned}

Here

In the first step we introduced the complex square root $\omega_{\pm \epsilon}(\vec k)$ . For this to be compatible with the choice of non-negative square root for $\epsilon = 0$ in (134) we need to choose that complex square root whose complex phase is one half that of $\omega(\vec k)^2 - i \epsilon$ (instead of that plus π). This means that $\omega_{+ \epsilon}(\vec k)$ is in the upper half plane and $\omega_-(\vec k)$ is in the lower half plane.
In the third step we observe that
1. for $(x^0 - y^0) \gt 0$ the integrand decays for positive imaginary part and hence the integration over $k_0$ may be deformed to a contour which encircles the pole in the upper half plane;
2. for $(x^0 - y^0) \lt 0$ the integrand decays for negative imaginary part and hence the integration over $k_0$ may be deformed to a contour which encircles the pole in the lower half plane
and then apply Cauchy's integral formula which picks out $2\pi i$ times the residue a these poles.

Notice that when completing to a contour in the lower half plane we pick up a minus signs from the fact that now the contour runs clockwise.
In the fourth step we used prop. .

It follows that:

Proposition

(Feynman propagator is Green function)
The Feynman propagator $\Delta_F$ for the Klein-Gordon equation on Minkowski spacetime (def. ) is proportional to a Green function for the Klein-Gordon equation in that

\left( \Box_x - \left( \tfrac{m c}{\hbar}\right)^2 \right) \Delta_{F}(x,y) = (+i) \delta(x-y) \,.

Proof

Equation (?) in prop. says that the Feynman propagator is the inverse Fourier transform of distributions of

\widehat{\Delta_F}(k) \;=\; (+i) \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{ 1 }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon }

This implies the statement as in the proof of prop. , via the analogue of equation (141).

$\,$

singular support and wave front sets

We now discuss the singular support (def. ) and the wave front sets (def. ) of the various propagators for the Klein-Gordon equation on Minkowski spacetime.

Proposition

(singular support of the causal propagator of the Klein-Gordon equation on Minkowski spacetime is the light cone)

The singular support of the causal propagator $\Delta_S$ for the Klein-Gordon equation on Minkowski spacetime, regarded via translation invariance as a generalized function in a single variable (136) is the light cone of the origin:

supp_{sing}(\Delta_S) \;=\; \left\{ x \in \mathbb{R}^{p,1} \,\vert\, {\vert x\vert}^2_\eta = 0 \right\} \,.

Proof

By prop. the causal propagator is equivalently the Fourier transform of distributions of the delta distribution of the mass shell times the sign function of the angular frequency; and by the basic properties of the Fourier transform (prop. ) this is the convolution of distributions of the separate Fourier transforms:

\begin{aligned} \Delta_S(x) & \propto \widehat{ \delta\left( \eta^{-1}(k,k) + \left( \tfrac{m c}{\hbar}\right)^2 \right) sgn( k_0 ) } \\ &\propto \widehat{\delta\left( \eta^{-1}(k,k) + \left( \tfrac{m c}{\hbar}\right)^2 \right)} \star \widehat{sgn( k_0 )} \end{aligned}

By prop. , the singular support of the first convolution factor is the light cone.

The second factor is

\begin{aligned} \widehat{sgn(k_0)} & \propto \left(2\widehat{\Theta(k_0)} - \widehat{1}\right) \delta(\vec k) \\ & \propto \left(2\tfrac{1}{i x^0 + 0^+} - \delta(x^0)\right) \delta(\vec k) \end{aligned}

(by example and example ) and hence the wave front set (def. ) of the second factor is

WF\left(\widehat{sgn(k_0)}\right) = \{(0,k) \;\vert\; k \in S(\mathbb{R}^{p+1})\}

(by example and example ).

With this the statement follows, via a partition of unity, from this prop..

For illustration we now make this general argument more explicit in the special case of spacetime dimension

p + 1 = 3 + 1

by computing an explicit form for the causal propagator in terms of the delta distribution, the Heaviside distribution and smooth Bessel functions.

We follow (Scharf 95 (2.3.18)).

Consider the formula for the causal propagator in terms of the mode expansion (145). Since the integrand here depends on the wave vector $\vec k$ only via its norm ${\vert \vec k\vert}$ and the angle $\theta$ it makes with the given spacetime vector via

\vec k \cdot (\vec x - \vec y) \;=\; {\vert \vec k\vert} \, {\vert \vec x\vert} \, \cos(\theta)

we may express the integration in terms of polar coordinates as follws:

\begin{aligned} \Delta_S(x - y) & = \frac{-1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y)} \, d^p \vec k \\ & = \frac{- vol_{S^{p-2}}}{(2\pi)^p} \underset{{\vert \vec k\vert} \in \mathbb{R}_{\geq 0}}{\int} \underset{ \theta \in [0,\pi] }{\int} \frac{ 1 }{ \omega(\vec k)/c } \sin\left( \omega(\vec k) (x^0 - y^0) /c \right) e^{ i {\vert \vec k\vert} {\vert \vec x - \vec y\vert} \cos(\theta) } {\vert \vec k\vert} ({\vert \vec k\vert} \sin(\theta))^{p-2} \, d \theta \wedge d {\vert \vec k\vert} \end{aligned}

In the special case of spacetime dimension $p + 1 = 3 + 1$ this becomes

(152)

\begin{aligned} \Delta_S(x - y) & = \frac{- 2\pi}{(2\pi)^{3}} \underset{{\vert \vec k\vert} \in \mathbb{R}_{\geq 0}}{\int} \frac{ {\vert \vec k \vert}^2 }{ \omega(\vec k)/c } \sin\left( \omega(\vec k) (x^0 - y^0) /c \right) \underset{ = \tfrac{1}{i {\vert \vec k\vert}\, {\vert \vec x - \vec y\vert} } \left( e^{i {\vert \vec k\vert}\, {\vert \vec x - \vec y\vert}} - e^{-i {\vert \vec k\vert}\, {\vert \vec x - \vec y\vert}} \right) }{ \underbrace{ \underset{ \cos(\theta) \in [-1,1] }{\int} e^{ i {\vert \vec k\vert} {\vert \vec x - \vec y\vert} \cos(\theta) } d \cos(\theta) } } \wedge d {\vert \vec k \vert} \\ & = \frac{- 2}{(2\pi)^{2} {\vert \vec x - \vec y\vert}} \underset{{\vert \vec k\vert} \in \mathbb{R}_{\geq 0}}{\int} \frac{ {\vert \vec k \vert} }{ \omega(\vec k)/c } \sin\left( \omega(\vec k) (x^0 - y^0) /c \right) \sin\left( {\vert \vec k\vert}\, {\vert \vec x - \vec y\vert} \right) \, d {\vert \vec k\vert} \\ & = \frac{- 2}{(2\pi)^{2} {\vert \vec x - \vec y\vert}} \frac{d}{d {\vert \vec x - \vec y \vert } } \underset{{\vert \vec k\vert} \in \mathbb{R}_{\geq 0}}{\int} \frac{ 1 }{ \omega(\vec k)/c } \sin\left( \omega(\vec k) (x^0 - y^0) /c \right) \cos\left( {\vert \vec k\vert}\, {\vert \vec x - \vec y\vert} \right) \, d {\vert \vec k\vert} \\ & = \frac{- 1}{(2\pi)^{2} {\vert \vec x - \vec y\vert}} \frac{d}{d {\vert \vec x - \vec y \vert } } \underset{ \kappa \in \mathbb{R} }{\int} \frac{ 1 }{ \omega(\kappa)/c } \sin\left( \omega(\kappa) (x^0 - y^0) /c \right) \cos\left( \kappa\, {\vert \vec x - \vec y\vert} \right) \, d \kappa \\ & = \frac{- 1}{2(2\pi)^{2} {\vert \vec x - \vec y\vert}} \frac{d}{d {\vert \vec x - \vec y} \vert } \left( \underset{\coloneqq I_+}{ \underbrace{ \underset{ \kappa \in \mathbb{R} }{\int} \frac{ 1 }{ \omega(\kappa)/c } \sin\left( \omega(\kappa) (x^0 - y^0) /c + \kappa\, {\vert \vec x - \vec y\vert} \right) d\kappa } } + \underset{ \coloneqq I_- }{ \underbrace{ \underset{ \kappa \in \mathbb{R} }{\int} \frac{ 1 }{ \omega(\kappa)/c } \sin\left( \omega(\kappa) (x^0 - y^0) /c - \kappa\, {\vert \vec x - \vec y\vert} \right) \, d \kappa } } \right) \,. \end{aligned}

Here in the second but last step we renamed $\kappa \coloneqq {\vert \vec k\vert}$ and doubled the integration domain for convenience, and in the last step we used the trigonometric identity $\sin(\alpha) \cos(\beta)\;=\; \tfrac{1}{2} \left( \sin(\alpha + \beta) + \sin(\alpha - \beta) \right)$ .

In order to further evaluate this, we parameterize the remaining components $(\omega/c, \kappa)$ of the wave vector by the dual rapidity $z$ , via

\left(\cosh(z)\right)^2 - \left( \sinh(z)\right)^2 = 1

\omega(\kappa)/c \;=\; \left( \tfrac{m c}{\hbar} \right) \cosh(z) \phantom{AA} \,, \phantom{AA} \kappa \;=\; \left( \tfrac{m c}{\hbar} \right) \sinh(z) \,,

which makes use of the fact that $\omega(\kappa)$ is non-negative, by construction. This change of integration variables makes the integrals under the braces above become

(153)

I_\pm \;=\; \int_{-\infty}^\infty \sin\left( \tfrac{m c}{\hbar} \left( (x^0 - y^0) \cosh(z) \pm {\vert \vec x - \vec y\vert} \sinh(z) \right) \right) \, d z \,.

Next we similarly parameterize the vector $x-y$ by its rapidity $\tau$ . That parameterization depends on whether $x-y$ is spacelike or not, and if not, whether it is future or past directed.

First, if $x-y$ is spacelike in that ${\vert x-y\vert}^2_\eta \gt 0$ then we may parameterize as

(x^0 - y^0) = \sqrt{{\vert x-y\vert}^2_\eta} \sinh(\tau) \phantom{AA} \,, \phantom{AA} {\vert \vec x - \vec y\vert} = \sqrt{ {\vert x-y\vert}^2_\eta} \cosh(\tau)

which yields

\begin{aligned} I_{\pm} & = \int_{-\infty}^\infty \sin\left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \left( \sinh(\tau) \cosh(z) \pm \cosh(\tau) \sinh(z) \right) \right) \, d z \\ & = \int_{-\infty}^\infty \sin\left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta} \left( \sinh\left( \tau \pm z\right) \right) \right) \, d z \\ & = \int_{-\infty}^\infty \sin\left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \left( \sinh\left( z \right) \right) \right) \, d z \\ & = 0 \,, \end{aligned}

where in the last line we observe that the integrand is a skew-symmetric function of $z$ .

Second, if $x-y$ is timelike with $(x^0 - y^0) \gt 0$ then we may parameterize as

(x^0 - y^0) = \sqrt{ -{\vert x-y\vert}^2_\eta} \cosh(\tau) \phantom{AA} \,, \phantom{AA} {\vert \vec x - \vec y\vert} = \sqrt{ -{\vert x - y\vert}^2_\eta } \sinh(\tau)

which yields

(154)

\begin{aligned} I_\pm & = \int_{-\infty}^\infty \sin\left( \tfrac{m c}{\hbar} \left( (x^0 - y^0) \cosh(z) \pm {\vert \vec x - \vec y\vert} \sinh(z) \right) \right) \, d z \\ & = \int_{-\infty}^\infty \sin\left( \sqrt{ - {\vert x-y\vert}^2_\eta } \tfrac{m c}{\hbar} \left( \cosh(\tau)\cosh(z) \pm \cosh(\tau) \sinh(z) \right) \right) \, d z \\ & = \int_{-\infty}^\infty \sin\left( \sqrt{ - {\vert x-y\vert}^2_\eta } \tfrac{m c}{\hbar} \left( \cosh(z \pm \tau) \right) \right) \, d z \\ & = \pi J_0\left( \sqrt{ - {\vert x-y\vert}^2_\eta} \tfrac{m c}{\hbar} \right) \end{aligned} \,.

Here in the last line we identified the integral representation of the Bessel function $J_0$ of order 0 (see here). The important point here is that this is a smooth function.

Similarly, if $x-y$ is timelike with $(x^0 - y^0) \lt 0$ then the same argument yields

I_\pm = - \pi J_0\left( \sqrt{ - {\vert x-y\vert}^2_\eta} \tfrac{m c}{\hbar} \right)

In conclusion, the general form of $I_\pm$ is

I_\pm = \pi sgn(x^0 - y^0) \Theta\left( -{\vert x-y\vert}^2_\eta \right) J_0\left( \sqrt{ - {\vert x-y\vert}^2_\eta } \tfrac{m c}{\hbar} \right) \,.

Therefore we end up with

(155)

\begin{aligned} \Delta_S(x,y) & = \frac{1}{4 \pi {\vert \vec x - \vec y\vert}} \frac{d}{d {\vert \vec x - \vec y\vert}} sgn(x^0) \Theta\left( -{\vert x-y\vert}^2_\eta \right) J_0\left( \sqrt{ -{\vert x-y\vert}^2_\eta } \tfrac{m c}{\hbar} \right) \\ & = \frac{-1}{2 \pi } \frac{d}{d (-{\vert x-y\vert}^2_\eta)} sgn(x^0) \Theta\left( -{\vert x-y\vert}^2_\eta \right) J_0\left( \sqrt{-{\vert x-y \vert}^2_\eta} \tfrac{m c}{\hbar} \right) \\ & = -\frac{1}{2 \pi } \frac{d}{d (- \vert x-y\vert^2_{\eta})} sgn(x^0) \Theta\left( - {\vert x - y\vert}^2_\eta \right) J_0\left( \tfrac{m c}{\hbar} \sqrt{ -{\vert x-y\vert}^2_\eta } \right) \\ & = \frac{-1}{2\pi} sgn(x^0) \left( \delta\left( -{\vert x-y\vert}^2_\eta \right) \;-\; \Theta\left( -{\vert x-y\vert}^2_\eta \right) \frac{d}{d \left({-\vert x-y\vert}^2_\eta\right) } J_0\left( \tfrac{m c}{\hbar} \sqrt{ -{\vert x-y\vert}^2_\eta } \right) \right) \end{aligned}

Proposition

(singular support of the Wightman propagator of the Klein-Gordon equation on Minkowski spacetime is the light cone)

The singular support of the Wightman propagator $\Delta_H$ (def. ) for the Klein-Gordon equation on Minkowski spacetime, regarded via translation invariance as a distribution in a single variable, is the light cone of the origin:

supp_{sing}(\Delta_H) = \left\{ x \in \mathbb{R}^{p,1} \;\vert\; {\vert x\vert}^2_\eta = 0 \right\} \,.

Proof

By prop. the causal propagator is equivalently the Fourier transform of distributions of the delta distribution of the mass shell times the sign function of the angular frequency; and by basic properties of the Fourier transform (prop. ) this is the convolution of distributions of the separate Fourier transforms:

\begin{aligned} \Delta_S(x) & \propto \widehat{ \delta\left( \eta^{-1}(k,k) + \left( \tfrac{m c}{\hbar}\right)^2 \right) sgn( k_0 ) } \\ &\propto \widehat{\delta\left( \eta^{-1}(k,k) + \left( \tfrac{m c}{\hbar}\right)^2 \right)} \star \widehat{sgn( k_0 )} \end{aligned}

By prop. , the singular support of the first convolution factor is the light cone.

The second factor is

\widehat{\Theta(k_0)} \propto \tfrac{1}{i x^0 + 0^+} \delta(\vec k)

(by example and example and hence the wave front set (def. ) of the second factor is

WF\left(\widehat{sgn(k_0)}\right) = \{(0,k) \;\vert\; k \in S(\mathbb{R}^{p+1})\}

(by example and example ).

With this the statement follows, via a partition of unity, from prop. .

For illustration, we now make this general statement fully explicit in the special case of spacetime dimension

p + 1 = 3 + 1

by computing an explicit form for the causal propagator in terms of the delta distribution, the Heaviside distribution and smooth Bessel functions.

We follow (Scharf 95 (2.3.36)).

By (151) we have

\begin{aligned} \Delta_H(x,y) & = \tfrac{i}{2} \underset{= \Delta_S(x,y)}{ \underbrace{ \frac{-1}{(2\pi)^p} \int \frac{1}{\omega(\vec k)/c} \sin\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k }} \;+\; \underset{ \coloneqq H(x,y) }{ \underbrace{ \frac{1}{(2\pi)^p} \int \frac{1}{2 \omega(\vec k)/c} \cos\left( \omega(\vec k)(x^0 - y^0)/c \right) e^{i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k }} \end{aligned}

The first summand, proportional to the causal propagator, which we computed as (155) in prop. to be

\tfrac{i}{2}\Delta_S(x,y) \;=\; \frac{-i}{4\pi} sgn(x^0) \left( \delta\left( -{\vert x-y\vert}^2_\eta \right) \;-\; \Theta\left( -{\vert x-y\vert}^2_\eta \right) \frac{d}{d \left({-\vert x-y\vert}^2_\eta\right) } J_0\left( \tfrac{m c}{\hbar} \sqrt{ -{\vert x-y\vert}^2_\eta } \right) \right) \,.

The second term is computed in a directly analogous fashion: The integrals $I_\pm$ from (153) are now

I_\pm \coloneqq \int_{-\infty}^\infty \cos\left( \tfrac{m c}{\hbar} \left( (x^0 - y^0) \cosh(z) \pm {\vert \vec x - \vec y\vert} \sinh(z) \right) \right) \, d z

Parameterizing by rapidity, as in the proof of prop. , one finds that for timelike $x-y$ this is

\begin{aligned} I_\pm & = \int_{-\infty}^\infty \cos\left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \left( \cosh\left( z \right) \right) \right) \, d z \\ & = - \pi N_0 \left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \right) \end{aligned}

while for spacelike $x-y$ it is

\begin{aligned} I_\pm & = \int_{-\infty}^\infty \cos\left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \left( \sinh\left( z \right) \right) \right) \, d z \\ & = 2 K_0 \left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \right) \,, \end{aligned}

where we identified the integral representations of the Neumann function $N_0$ (see here) and of the modified Bessel function $K_0$ (see here).

As for the Bessel function $J_0$ in (154) the key point is that these are smooth functions. Hence we conclude that

H(x,y) \;\propto\; \frac{d}{d \left( {\vert x-y\vert}^2_\eta \right)} \left( -\Theta\left( -{\vert x-y\vert}^2_\eta \right) N_0 \left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \right) + \Theta\left( {\vert x-y\vert}^2_\eta \right) \tfrac{2}{\pi} K_0 \left( \tfrac{m c}{\hbar} \sqrt{ {\vert x-y\vert}^2_\eta } \right) \right) \,.

This expression has singularities on the light cone due to the step functions. In fact the expression being differentiated is continuous at the light cone (Scharf 95 (2.3.34)), so that the singularity on the light cone is not a delta distribution singularity from the derivative of the step functions. Accordingly it does not cancel the singularity of $\tfrac{i}{2}\Delta_S(x,y)$ as above, and hence the singular support of $\Delta_H$ is still the whole light cone.

Proposition

(singular support of Feynman propagator for Klein-Gordon equation on Minkowski spacetime)

The singular support of the Feynman propagator $\Delta_H$ and of the anti-Feynman propagator $\Delta_{\overline{F}}$ (def. ) for the Klein-Gordon equation on Minkowski spacetime, regarded via translation invariance as a distribution in a single variable, is the light cone of the origin:

\left. \array{ supp_{sing}(\Delta_F) \\ supp_{sing}(\Delta_{\overline{F}}) } \right\} = \left\{ x \in \mathbb{R}^{p,1} \;\vert\; {\vert x\vert}^2_\eta = 0 \right\} \,.

(e.g DeWitt 03 (27.85))

Proof

By prop. the Feynman propagator is equivalently the Cauchy principal value of the inverse of the Fourier transformed Klein-Gordon operator:

\Delta_F \;\propto\; \widehat{ \frac{1}{-k_\mu k^\mu - \left(\tfrac{m c}{\hbar}\right)^2 + i 0^+} } \,.

With this, the statement follows immediately from prop. .

Proposition

(wave front sets of propagators of Klein-Gordon equation on Minkowski spacetime)

The wave front set of the various propagators for the Klein-Gordon equation on Minkowski spacetime, regarded, via translation invariance, as distributions in a single variable, are as follows:

the causal propagator $\Delta_S$ (prop. ) has wave front set all pairs $(x,k)$ with $x$ and $k$ both on the lightcone:

WF(\Delta_S) = \left\{ (x,k) \,\vert\, {\vert x\vert}^2_\eta = 0 \;\text{and} \; {\vert k\vert}^2_\eta = 0 \; \text{and} \, k \neq 0 \right\}

the Wightman propagator $\Delta_H$ (def. ) has wave front set all pairs $(x,k)$ with $x$ and $k$ both on the light cone and $k^0 \gt 0$ :

WF(\Delta_H) = \left\{ (x,k) \,\vert\, {\vert x\vert}^2_\eta = 0 \;\text{and} \; {\vert k\vert}^2_\eta = 0 \; \text{and} \; k^0 \gt 0 \right\}

the Feynman propagator $\Delta_S$ (def. ) has wave front set all pairs $(x,k)$ with $x$ and $k$ both on the light cone and $\pm k_0 \gt 0 \;\Leftrightarrow\; \pm x^0 \gt 0$

WF(\Delta_H) = \left\{ (x,k) \,\vert\, {\vert x\vert}^2_\eta = 0 \;\text{and} \; {\vert k\vert}^2_\eta = 0 \; \text{and} \; \left( \pm k_0 \gt 0 \;\Leftrightarrow\; \pm x^0 \gt 0 \right) \right\}

(Radzikowski 96, (16))

Proof

First regarding the causal propagator:

By prop. the singular support of $\Delta_S$ is the light cone.

Since the causal propagator is a solution to the homogeneous Klein-Gordon equation, the propagation of singularities theorem (prop. ) says that also all wave vectors in the wave front set are lightlike. Hence it just remains to show that all non-vanishing lightlike wave vectors based on the lightcone in spacetime indeed do appear in the wave front set.

To that end, let $b \in C^\infty_{cp}(\mathbb{R}^{p,1})$ be a bump function whose compact support includes the origin.

For $a \in \mathbb{R}^{p,1}$ a point on the light cone, we need to determine the decay property of the Fourier transform of $x \mapsto b(x-a)\Delta_S(x)$ . This is the convolution of distributions of $\hat b(k)e^{i k_\mu a^\mu}$ with $\widehat \Delta_S(k)$ . By prop. we have

\widehat \Delta_{S}(k) \;\propto\; \delta\left( -k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \right) sgn(k_0) \,.

This means that the convolution product is the smearing of the mass shell by $\widehat b(k)e^{i k_\mu a^\mu}$ .

Since the mass shell asymptotes to the light cone, and since $e^{i k_\mu a^\mu} = 1$ for $k$ on the light cone (given that $a$ is on the light cone), this implies the claim.

Now for the Wightman propagator:

By def. its Fourier transform is of the form

\widehat \Delta_H(k) \;\propto\; \delta\left( k_\mu k^\mu + m^2 \right) \Theta( -k_0 )

Moreover, its singular support is also the light cone (prop. ).

Therefore now same argument as before says that the wave front set consists of wave vectors $k$ on the light cone, but now due to the step function factor $\Theta(-k_0)$ it must satisfy $0 \leq - k_0 = k^0$ .

Finally regarding the Feynman propagator:

by prop. the Feynman propagator coincides with the positive frequency Wightman propagator for $x^0 \gt 0$ and with the “negative frequency Hadamard operator” for $x^0 \lt 0$ . Therefore the form of $WF(\Delta_F)$ now follows directly with that of $WF(\Delta_H)$ above.

$\,$

propagators for the Dirac equation on Minkowski spacetime

We now discuss how the propagators for the free Dirac field on Minkowski spacetime (example ) follow directly from those for the scalar field discussed above.

Proposition

(advanced and retarded propagator for Dirac equation on Minkowski spacetime)

Consider the Dirac operator on Minkowski spacetime, which in Feynman slash notation reads

\begin{aligned} D & \coloneqq -i {\partial\!\!\!/\,} + \tfrac{m c}{\hbar} \\ & = -i \gamma^\mu \frac{\partial}{\partial x^\mu} + \tfrac{m c}{\hbar} \end{aligned} \,.

Its advanced and retarded propagators (def. ) are the derivatives of distributions of the advanced and retarded propagators $\Delta_\pm$ for the Klein-Gordon equation (prop. ) by ${\partial\!\!\!/\,} + m$ :

\Delta_{D, \pm} \;=\; \left( -i{\partial\!\!\!/\,} - \tfrac{m c}{\hbar} \right) \Delta_{\pm} \,.

Hence the same is true for the causal propagator:

\Delta_{D, S} \;=\; \left( -i{\partial\!\!\!/\,} - \tfrac{m c}{\hbar} \right) \Delta_{S} \,.

Proof

Applying a differential operator does not change the support of a smooth function, hence also not the support of a distribution. Therefore the uniqueness of the advanced and retarded propagators (prop. ) together with the translation-invariance and the anti-formally self-adjointness of the Dirac operator (as for the Klein-Gordon operator (136) implies that it is sufficent to check that applying the Dirac operator to the $\Delta_{D, \pm}$ yields the delta distribution. This follows since the Dirac operator squares to the Klein-Gordon operator:

\begin{aligned} \left( -i{\partial\!\!\!/\,} + \tfrac{m c}{\hbar} \right) \Delta_{D, \pm} & = \underset{ = \Box - \left(\tfrac{m c}{\hbar}\right)^2}{ \underbrace{ \left( -i{\partial\!\!\!/\,} + \tfrac{m c}{\hbar} \right) \left( -i{\partial\!\!\!/\,} - \tfrac{m c}{\hbar} \right) } } \Delta_{\pm} \\ & = \delta \end{aligned} \,.

Similarly we obtain the other propagators for the Dirac field from those of the real scalar field:

Definition

(Wightman propagator for Dirac operator on Minkowski spacetime)

The Wightman propagator for the Dirac operator on Minkowski spacetime is the positive frequency part of the causal propagator (prop. ), hence the derivative of distributions (def. ) of the Wightman propagator for the Klein-Gordon field (def. ) by the Dirac operator:

\begin{aligned} \left( -i{\partial\!\!\!/\,} + \tfrac{m c}{\hbar} \right)\Delta_{H}(x,y) & = \frac{1}{(2\pi)^p} \int \delta\left( k_\mu k^\mu + m^2 \right) \Theta( -k_0 ) ( {k\!\!\!/\,} + \tfrac{m c}{\hbar}) e^{i k_\mu (x^\mu-y^\mu) } \, d^{p+1} k \\ & = \frac{1}{(2\pi)^p} \int \frac{ \gamma^0 \omega(\vec k)/c + \vec \gamma \cdot \vec k + \tfrac{m c}{\hbar} }{2 \omega(\vec k)/c} e^{-i \omega(\vec k)(x^0 - y^0)/c + i \vec k \cdot (\vec x - \vec y) } \, d^p \vec k \,. \end{aligned}

Here we used the expression (?) for the Wightman propagator of the Klein-Gordon equation.

Definition

(Feynman propagator for Dirac operator on Minkowski spacetime)

The Feynman propagator for the Dirac operator on Minkowski spacetime is the linear combination

\Delta_{D, F} \;\coloneqq\; \Delta_{D,H} + i \Delta_{D, -}

of the Wightman propagator (def. ) and the retarded propagator (prop. ). By prop. this means that it is the derivative of distributions (def. ) of the Feynman propagator of the Klein-Gordon equation (def. ) by the Dirac operator

\begin{aligned} \Delta_{D, F} & = \left( -i{\partial\!\!\!/\,} + \tfrac{m c}{\hbar} \right)\Delta_{F}(x,y) & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{-i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ \left( {k\!\!\!/\,} + \tfrac{m c}{\hbar} \right) e^{i k_\mu (x^\mu - y^\mu)} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 + i \epsilon } \, d k_0 \, d^p \vec k \,. \end{aligned}

$\,$

This concludes our discussion of propagators induced from the covariant phase space of Green hyperbolic free Lagrangian field theory. These propagators will be the key in for quantization via causal perturbation theory. But not all free field theories have a covariant phase space of Green hyperbolic equations of motion, for instance the electromagnetic field, a priori, does not. Therefore before turning to quantization in the next chapter we first discuss how gauge symmetries obstruct the existence of Green hyperbolic equations of motion.

Gauge symmetries

In this chapter we discuss these topics:

compactly supported infinitesimal symmetries obstruct covariant phase space
Infinitesimal gauge symmetries
Lie algebra action and Lie algebroids
BRST complex
Examples of local BRST complexes

$\,$

An infinitesimal gauge symmetry of a Lagrangian field theory (def. below) is a infinitesimal symmetry of the Lagrangian which may be freely parameterized, hence “gauged”, by a gauge parameter. A Lagrangian field theory exhibiting these is also called a gauge theory.

By choosing the gauge parameter to have compact support, infinitesimal gauge symmetries in particular yield infinitesimal symmetries of the Lagrangian with compact spacetime support. One finds (prop. below) that the existence of on-shell non-trivial symmetries of this form is an obstruction to the existence of the covariant phase space of the theory (prop. ).

$\,$

gauge symmetries

name	meaning	def.
infinitesimal symmetry of the Lagrangian	evolutionary vector field which leaves invariant the Lagrangian density up to a total spacetime derivative	def.
spacetime-compactly supported infinitesimal symmetry of the Lagrangian	obstructs existence of the covariant phase space (if non-trivial on-shell)	prop.
infinitesimal gauge symmetry	gauge parameterized collection of infinitesimal symmetries of the Lagrangian; for compactly supported gauge parameter this yields spacetime-compactly supported infinitesimal symmetries	def.
rigid infinitesimal symmetry of the Lagrangian	infinitesimal symmetry modulo gauge symmetry	def.
generating set of gauge parameters	reflects all the Noether identities	remark
closed gauge parameters	Lie bracket of infinitesimal gauge symmetries closes on gauge parameters	def.

$\,$

But we may hard-wire these gauge equivalences into the very geometry of the types of fields by forming the homotopy quotient of the action of the infinitesimal gauge symmetries on the jet bundle. This homotopy quotient is modeled by the action Lie algebroid (def. below). Its algebra of functions is the local BRST complex of the theory (def. ) below.

In this construction the gauge parameters appear as auxiliary fields whose field bundle is a graded version of the gauge parameter-bundle. As such they are called ghost fields. The ghost fields may have infinitesimal gauge symmetries themselves which leads to ghost-of-ghost fields, etc. (example ) below.

It is these auxiliary ghost fields and ghost-of-ghost fields which will serve to remove the obstruction to the existence of the covariant phase space for gauge theories, this we arrive at in Gauge fixing, further below.

gauge parameters and ghost fields

symbol	meaning	def.
$\mathcal{G} \overset{gb}{\to} \Sigma$	gauge parameter bundle	def.
$c^\alpha \in C^\infty(\mathcal{G})$	coordinate function on gauge parameter bundle
$\epsilon \in \Gamma_\Sigma(\mathcal{G})$	gauge parameter
$\mathcal{G}[1]$	gauge parameter bundle regarded as graded manifold in degree 1	expl.
$C \in \Gamma_\Sigma(\mathcal{G}[1])$	gost field history
$\underset{deg = 1}{\underbrace{c^\alpha}} \in C^\infty(\mathcal{G}[1])$	ghost field component function
$\underset{deg = 1}{\underbrace{c^\alpha_{,\mu_1 \cdots \mu_k}}} \in C^\infty(J^\infty_\Sigma(\mathcal{G}[1]))$	ghost field jet component function
$\phantom{A}$
$\overset{(2)}{\mathcal{G}} \overset{gb}{\to} \Sigma$	gauge-of-gauge parameter bundle	expl.
$\overset{(2)}{c}^\alpha \in C^\infty(\overset{(2)}{\mathcal{G}})$	coordinate function on gauge-of-gauge parameter bundle
$\overset{(2)}{\epsilon} \in \Gamma_\Sigma(\mathcal{G})$	gauge-of-gauge parameter
$\overset{(2)}{\mathcal{G}}[2]$	gauge-of-gauge parameter bundle regarded as graded manifold in degree 1
$\overset{(2)}{C} \in \Gamma_\Sigma(\mathcal{G}[1])$	gost-of-ghost field history
$\underset{deg = 2}{\underbrace{\overset{(2)}{c}{}^\alpha}} \in C^\infty(\overset{(2)}{\mathcal{G}}[2])$	ghost-of-ghost field component function
$\underset{deg = 2}{\underbrace{{\overset{(2)}{c}}{}^\alpha_{,\mu_1 \cdots \mu_k}}} \in C^\infty(J^\infty_\Sigma(\overset{(2)}{\mathcal{G}}[2]))$	ghost-of-ghost field jet component function

The mathematical theory capturing these phenomena is the higher Lie theory of Lie-∞ algebroids (def. below).

$\,$

compactly supported infinitesimal symmetries obstruc the covariant phase space

As an immediate corollary of prop. we have the following important observation:

Proposition

(spacetime-compactly supported and on-shell non-trivial infinitesimal symmetries of the Lagrangian obstruct the covariant phase space)

Let $(E,\mathbf{L})$ be a Lagrangian field theory over a Lorentzian spacetime.

If there exists a single infinitesimal symmetry of the Lagrangian $v$ (def. ) such that

it has compact spacetime support (def. )
it does not vanish on-shell (52) (so not a trivial one, example )

then there does not exist any Cauchy surface (def. ) for the Euler-Lagrange equations of motion (def. ) outside the spacetime support of $v$ .

Proof

By prop. the flow along $\hat v$ preserves the on-shell space of field histories. Now by the assumption that $\hat v$ does not vanish on-shell implies that this flow is non-trivial, hence that it does continuously change the field histories over some points of spacetime, while the assumption that it has compact spacetime support means that these changes are confined to a compact subset of spacetime.

This means that there is a continuum of solutions to the equations of motion whose restriction to the infinitesimal neighbourhood of any codimension-1 suface $\Sigma_p \hookrightarrow \Sigma$ outside of this compact support coincides. Therefore this restriction map is not an isomorphism and $\Sigma_p$ not a Cauchy surface for the equations of motion.

Notice that there always exist spacetime-compactly supported infinitesimal symmetries that however do vanish on-shell:

Example

(trivial compactly-supported infinitesimal symmetries of the Lagrangian)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime (def. ), so that the Lagrangian density is canonically of the form

\mathbf{L} = L \, dvol_\Sigma

with Lagrangian function $L \in \Omega^{0,0}_\Sigma(E) = C^\infty(J^\infty_\Sigma(E))$ a smooth function of the jet bundle (characterized by prop. ).

Then every evolutionary vector field (def. ) whose coefficients which is proportional to the Euler-Lagrange derivative (50) of the Lagrangian function $L$

v \; \coloneqq \; \frac{\delta_{EL} L }{\delta \phi^a} \kappa^{[a b]} \, \partial_{\phi^a} \;\in\; \Gamma_E^{ev}( T_\Sigma E )

by smooth coefficient functions $\kappa^{a b}$

\kappa^{[a b]} \;\in\; \Omega^{0,0}_\Sigma(E)

such that

each $\kappa^{a b}$ has compact spacetime support (def. )
$\kappa$ is skew-symmetric in its indices: $\kappa^{[a b]} = - \kappa^{[b a]}$

is an implicit infinitesimal gauge symmetry (def. ).

This is so for a “trivial reason” namely due to that that skew symmetry:

\begin{aligned} \mathcal{L}_{\hat v} \mathbf{L} & = \iota_{\hat v} \delta \mathbf{L} \\ &= \iota_{\hat v} ( \delta_{EL}\mathbf{L} - d \Theta_{BFV} ) \\ & = \iota_\epsilon \frac{\delta_{EL}L}{\delta \phi^a} \delta \phi^a + d \iota_{\hat v}\Theta_{BFV} \\ & = \underset{= 0}{ \underbrace{ \left( \frac{\delta_{EL} L }{\delta \phi^a} \right) \left( \frac{\delta_{EL} L }{\delta \phi^b} \right) \kappa^{[a b]} } } \, dvol_\Sigma \;+\; d \iota_{\hat v} \Theta_{BFV} \\ & = d \iota_{\hat v} \Theta_{BFV} \end{aligned}

Here the first steps are just recalling those in the proof of Noether's theorem I (prop. ) while the last step follows with the skew-symmetry of $\kappa$ .

Notice that this means that

the Noether current (79) vanishes: $J_{\hat v} = 0$ ;
the infinitesimal symmetry vanishes on-shell (41): $\hat v \vert_{\mathcal{E}} = 0$ .

Therefore these implicit infinitesimal gauge symmetries are called the trivial infinitesimal gauge transformations.

(e.g. Henneaux 90, section 2.5)

$\,$

Proposition implies that we need a good handle on determining whether the space of non-trivial compactly supported infinitesimal symmetries of the Lagrangian modulo trivial ones is non-zero. This obstruction turns out to be neatly captured by methods of homological algebra applied to the local BV-complex (def. ):

Example

(cochain cohomology of local BV-complex)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle $E$ is a trivial vector bundle (example ) and whose Lagrangian density $\mathbf{L}$ is spacetime-independent (example ), and let $\Sigma \times \{\varphi\} \hookrightarrow \mathcal{E}$ be a constant section of the shell (59).

By inspection we find that the cochain cohomology of the local BV-complex $\Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}_{BV}}$ (def. ) has the following interpretation:

In degree 0 the image of the BV-differential coming from degree -1 and modulo $d$ -exact terms

im\left( \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma(E,\varphi)) \overset{s_{BV}}{\to} \Omega^{0,0}_\Sigma(E,\varphi)/im(d) \right)

is the ideal of functions modulo $im(d)$ that vanish on-shell. Since the differential going from degree 0 to degree 1 vanishes, the cochain cohomology in this degree is the quotient ring

\begin{aligned} H^0\left(\Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}_{BV}}\vert d\right) & \simeq \Omega^{0,0}_{\Sigma,cp}(E,\varphi)\vert_{\mathcal{E}}/im(d) \end{aligned}

of functions on the shell $\mathcal{E}$ (109).

In degree -1 the kernel of the BV-differential going to degree 0

ker\left( \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma(E,\varphi)) \overset{s_{BV}}{\to} \Omega^{0,0}_\Sigma(E,\varphi)\right)

is the space of implicit infinitesimal gauge symmetries (def. ) and the image of the differential coming from degree -2

im\left( \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi) \wedge_{\Omega^{0,0}_{\Sigma,cp}(E,\varphi)} \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi) \overset{s_{BV}}{\longrightarrow} \Gamma_{\Sigma,cp}(J^\infty_\Sigma T_\Sigma E,\varphi) \right)

is the trivial implicit infinitesimal gauge transformations (example ).

Therefore the cochain cohomology in degree -1 is the quotient space of implicit infinitesimal gauge transformations modulo the trivial ones:

(156)

H^{-1}\left( \Omega^{0,0}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}} \right) \simeq \frac{ \left\{ \text{implicit infinitesimal gauge transformations} \right\} } { \left\{ \text{ trivial implicit infinitesimal gauge transformations} \right\} }

Proposition

(local BV-complex is homological resolution of the shell iff there are no non-trivial compactly supported infinitesimal symmetries)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) whose field bundle $E$ is a trivial vector bundle (example ) and whose Lagrangian density $\mathbf{L}$ is spacetime-independent (example ) and let $\Sigma \times \{\varphi\} \hookrightarrow \mathcal{E}$ be a constant section of the shell (59). Furthermore assume that $\mathbf{L}$ is at least quadratic in the vertical coordinates around $\varphi$ .

Then the local BV-complex $\Omega^{0,0}_\Sigma(E,\varphi)\vert_{\mathcal{E}_{BV}}$ of local observables (def. ) is a homological resolution of the algebra of functions on the infinitesimal neighbourhood of $\varphi$ in the shell (example ), hence the canonical comparison morphisms (113) is a quasi-isomorphism precisely if there is no non-trivial (example ) implicit infinitesimal gauge symmetry (def. ):

\left( \Omega^{0,0}_{\Sigma}(E,\varphi)\vert_{\mathcal{E}_{BV}} \overset{\simeq}{\longrightarrow} \Omega^{0,0}_{\Sigma}(E,\varphi)\vert_{\mathcal{E}} \right) \;\Leftrightarrow\; \left( \array{ \text{there are no non-trivial} \\ \text{compactly supported infinitesimal symmetries} } \right) \,.

Proof

By example the vanishing of compactly supported infinitesimal symmetries is equivalent to the vanishing of the cochain cohomology of the local BV-complex in degree -1 (156).

Therefore the statement to be proven is equivalently that the Koszul complex of the sequence of elements

\left( \frac{\delta_{EL} L}{\delta \phi^a} \in \Omega^{0,0}_{\Sigma,\varphi}(E) \right)_{a = 1}^s

is a homological resolution of $\Omega^{0,0}_\Sigma(E,\varphi)\vert_{\mathcal{E}}$ , hence has vanishing cohomology in all negative degrees, already if it has vanishing cohomology in degree -1.

By a standard fact about Koszul complexes (this prop.) a sufficient condition for this to be the case is that

the ring $\Omega^{0,0}_{\Sigma}(E,\varphi)$ is the tensor product of $C^\infty(\Sigma)$ with a Noetherian ring;
the elements $\frac{\delta_{EL} L }{\delta \phi^a}$ are contained in its Jacobson radical.

The first condition is the case since $\Omega^{0,0}_{\Sigma}(E,\varphi)$ is by definition a formal power series ring over a field tensored with $C^\infty(\Sigma)$ (by this example). Since the Jacobson radical of a power series algebra consists of those elements whose constant term vanishes (see this example), the assumption that $\mathbf{L}$ is at least quadratic, hence that $\delta_{EL}\mathbf{L}$ is at least linear in the fields, guarantees that all $\frac{\delta_{EL}L}{\delta \phi^a}$ are contained in the Jacobson radical.

Prop. says what gauge fixing has to accomplish: given a local BV-BRST complex we need to find a quasi-isomorphism to another complex which is such that it comes from a graded Lagrangian density whose BV-cohomology vanishes in degree -1 and hence induces a graded covariant phase space, and such that the remaining BRST differential respects the Poisson bracket on this graded covariant phase space.

$\,$

infinitesimal gauge symmetries

Prop. says that the problem is to identify the presence of spacetime-compactly supported infinitesimal symmetries that are on-shell non-trivial. One way they may be identified is if infinitesimal symmetries appear in linearly parameterized collections, where the parameter itself is an arbitrary spacetime-dependent section of some fiber bundle (hence is itself like a field history), because then choosing the parameter to have compact support yields an infinitesimal symmetry of the Lagrangian with compact spacetime support (remark below).

In this case we speak of a gauge parameter (def. below). It turns out that in most examples of Lagrangian field theories of interest, the compactly supported infinitesimal symmetries all come from gauge parameters this way. Therefore we now consider this case in detail.

Definition

(infinitesimal gauge symmetries)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ).

Then a collection of infinitesimal gauge symmetries of $(E,\mathbf{L})$ is

a vector bundle $\mathcal{G} \overset{gb}{\longrightarrow} \Sigma$ over spacetime $\Sigma$ of positive rank, to be called a gauge parameter bundle;
a bundle morphism (def. ) $R$ from the jet bundle of the fiber product $\mathcal{G} \times_\Sigma E$ with the field bundle (def. ) to the vertical tangent bundle of $E$ (def. ):

$\array{ J^\infty_\Sigma( \mathcal{G} \times_\Sigma E ) && \overset{R}{\longrightarrow} && T_\Sigma E & \overset{i}{\hookrightarrow} & T_\Sigma (\mathcal{G} \times_\Sigma E) \\ & \searrow && \swarrow \\ && E }$

such that

$R$ is linear in the first argument (in the gauge parameter);
$i \circ R$ is an evolutionary vector field on $\mathcal{G} \times_\Sigma E$ (def. );
$R$ is an infinitesimal symmetry of the Lagrangian (def. ) in the second argument.

We may express this equivalently in components in the case that the field bundle $E$ is a trivial vector bundle with field fiber coordinates $(\phi^a)$ (example ) and also $\mathcal{G}$ happens to be a trivial vector bundle

\mathcal{G} = \Sigma \times \mathfrak{g}

where $\mathfrak{g}$ is a vector space with coordinate functions $\{c^\alpha\}$ .

Then $R$ may be expanded in the form

(157)

R \;=\; \left( c^\alpha R^a_\alpha + c^\alpha_{,\mu} R^{a \mu}_\alpha + c^\alpha_{,\mu_1 \mu_2} R^{a \mu_1 \mu_2}_\alpha + \cdots \right) \partial_{\phi^a} \,,

where the components

R^{a \mu_1 \cdots \mu_k}_\alpha = R^{a \mu_1 \cdots \mu_k}_\alpha\left( (\phi^b), (\phi^b_{,\mu}), \cdots \right) \;\in\; \Omega^{0,0}_\Sigma(E) = C^\infty(J^\infty_\Sigma(E))

are smooth functions on the jet bundle of $E$ , locally of finite order (prop. ), and such that the Lie derivative of the Lagrangian density along $R(e)$ is a total spacetime derivative, which by Noether's theorem I (prop. ) means in components that

\left( c^\alpha R^a_\alpha + c^\alpha_{,\mu} R^{a \mu}_\alpha + c^\alpha_{,\mu_1 \mu_2} R^{a \mu_1 \mu_2}_\alpha + \cdots \right) \frac{\delta_{EL} \mathbf{L}}{\delta \phi^a} \;=\; \frac{d}{d x^\mu} J^\mu_{R} \,.

(e.g. Henneaux 90 (3))

The point is that infinitesimal gauge symmetries in particular yield spacetime-compactly supported infinitesimal gauge symmetries as in prop. :

Remark

(infinitesimal gauge symmetries yield spacetime-compactly supported infinitesimal symmetries of the Lagrangian)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) and $\mathcal{G} \overset{gb}{\to} \Sigma$ a bundle of gauge parameters for it (def. ) with gauge parametrization

J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E \,.

Then for every smooth section $\epsilon \in \Gamma_\Sigma(\mathcal{G})$ of the gauge parameter bundle (def. ) there is an induced infinitesimal symmetry of the Lagrangian (def. ) given by the composition of $R$ with the jet prolongation of $\epsilon$ (def. )

R(\epsilon) \;\colon\; J^\infty_\Sigma(E) = \Sigma \times_\Sigma J^\infty_\Sigma(E) \overset{(j^\infty_\Sigma(\epsilon),id)}{\longrightarrow} J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E \,.

In terms of the components (157) this means that

R(\epsilon) \;=\; \left( \epsilon^\alpha R^a_\alpha + \frac{\partial^2 \epsilon^\alpha}{\partial x^\mu} R^{a \mu}_\alpha + \frac{\partial \epsilon^\alpha}{\partial x^\mu \partial x^\nu} R^{a \mu_1 \mu_2}_\alpha + \cdots \right) \,,

where now

\frac{\partial^k \epsilon^\alpha}{\partial x^{\mu_1} \cdots \partial x^{\mu_k}} \;=\; \frac{\partial^k \epsilon^\alpha}{\partial x^{\mu_1} \cdots \partial x^{\mu_k}}((x^\mu))

are the actual spacetime partial derivatives of the gauge parameter section (which are functions of spacetime).

In particular, since $\mathcal{G} \overset{gb}{\to} \Sigma$ is assumed to be a vector bundle, there always exists gauge parameter sections $\epsilon$ that have compact support (bump functions). For such compactly supported $\epsilon$ the infinitesimal symmetry $R(\epsilon)$ is spacetime-compactly supported as in prop. .

$\,$

The following remark and def. introduce some useful terminology:

Remark

(generating set of gauge transformations)

Given a Lagrangian field theory, then a choice of gauge parameter bundle $\mathcal{G} \overset{gb}{\to} \Sigma$ with gauge parameterized infinitesimal gauge symmetries $J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E$ (def. ) is indeed a choice and not uniquely fixed.

For example given any such bundle one may form the direct sum of vector bundles $\mathcal{G} \oplus_\Sigma \mathcal{G}'$ with any other smooth vector bundle $\mathcal{G}'$ over $\Sigma$ , extend $R$ by zero to $\mathbb{G}'$ , and thereby obtain another gauge parameterized of infinitesimal gauge symmetries

J^\infty_\Sigma((\mathcal{G}' \oplus_\Sigma \mathcal{G}) \times_\Sigma E) \overset{(0,R)}{\longrightarrow} T_\Sigma E \,.

Conversely, given any subbundle $\mathcal{G}' \hookrightarrow \mathcal{G}$ , then the restriction of $R$ to $\mathcal{G}'$ is still a gauge parameterized collection of infinitesimal gauge symmetries.

We will see that for the purpose of removing the obstruction to the existence of the covariant phase space, the gauge parameters have to capture all Noether identities (prop. ). In this case one says that the gauge parameter bundle $\mathcal{G} \overset{gb}{\to} \Sigma$ is a generating set.

(e.g. Henneaux 90, section (2.8))

Definition

(rigid infinitesimal symmetries of the Lagrangian)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) and let $J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E$ be infinitesimal gauge symmetries (def. ) whose gauge parameters form a generating set (remark ).

Then the vector space of rigid infinitesimal symmetries of the Lagrangian is the quotient space of the infinitesimal symmetries of the Lagrangian by the image of the infinitesimal gauge symmetries:

\left\{ \text{rigid infinitesimal symmetries} \right\} \;=\; \left\{ \text{infinitesimal symmetries} \right\} \,/\, \left\{ \text{infinitesimal gauge symmetries} \right\} \,.

The following is a way to identify infinitesimal gauge symmetries:

Proposition

(Noether's theorem II – Noether identities)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a vector bundle.

Then a bundle morphism of the form

J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E

is a collection of infinitesimal gauge symmetries (def. ) with local components (157)

R \;=\; \left( c^\alpha R^a_\alpha + c^\alpha_{,\mu} R^{a \mu}_\alpha + c^\alpha_{,\mu_1 \mu_2} R^{a \mu_1 \mu_2}_\alpha + \cdots \right) \partial_{\phi^a}

precisely if the Euler-Lagrange form $\delta_{EL}\mathbf{L}$ (prop. ) satisfies the following conditions:

\left( R^{a}_\alpha \frac{\delta_{EL}\mathbf{L}}{\delta \phi^a} - \frac{d}{d x^\mu} \left( R^{a \mu}_\alpha \frac{\delta_{EL}\mathbf{L}}{\delta \phi^a} \right) + \frac{d^2}{d x^{\mu_1} d x^{\mu_2}} \left( R^{a \mu_1 \mu_2}_\alpha \frac{\delta_{EL}\mathbf{L}}{\delta \phi^a} \right) - \cdots \right) \;=\; 0 \,.

These relations are called the Noether identities of the Euler-Lagrange equations of motion (def ).

Proof

By Noether's theorem I, $R$ is an infinitesimal symmetry of the Lagrangian precisely if the contraction (def. ) of $R$ with the Euler-Lagrange form (prop. ) is horizontally exact:

\iota_{R} \delta_{EL}\mathbf{L} = d J_{\hat R} \,.

From (157) this means that

(158)

\begin{aligned} d J_{\hat R} & = \iota_{R} \delta_{EL} \mathbf{L} \\ & = \underset{k \in \mathbb{N}}{\sum} c^\alpha_{,\mu_1 \cdots \mu_k} R^{a \mu_1 \cdots \mu_k}_\alpha \frac{\delta_{EL} \mathbf{L}}{\delta \phi^a} \\ & = \underset{A}{ \underbrace{ c^\alpha \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R^{a \mu_1 \cdots \mu_k}_\alpha \frac{\delta_{EL} \mathbf{L}}{\delta \phi^a} \right) } } + d K \,, \end{aligned}

where in the last step we used jet-level integration by parts (example ) to move the total spacetime derivatives off of $c^\alpha$ , thereby picking up some horizontally exact correction term, as shown.

This means that the term $A$ over the brace is horizontally exact:

(159)

c^\alpha \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R^{a \mu_1 \cdots \mu_k}_\alpha \frac{\delta_{EL} \mathbf{L}}{\delta \phi^a} \right) \;=\; d(...)

But now the term on the left is independent of the jet coordinates $\epsilon^\alpha_{,\mu_1 \cdots \mu_k}$ of positive order $k \geq 1$ , while the horizontal derivative increases the dependency on the jet order by one. Therefore the term on the left is horizontally exact precisely if it vanishes, which is the case precisely if the coefficients of $c^\alpha$ vanish, which is the statement of the Noether identities.

Alternatively we may reach this conclusion from (159) by applying to both sides of (159) the Euler-Lagrange derivative (50) with respect to $c^\alpha$ . On the left this yields again the coefficients of $c^\alpha$ , while by the argument from example it makes the right hand side vanish.

As a corollary we obtain:

Proposition

(conserved charge of infinitesimal gauge symmetry vanishes)

The conserved current (def. )

J_R \;\in\; \Omega^{p,0}_\Sigma(E \times_\Sigma \mathcal{G})

which corresponds to an infinitesimal gauge symmetry $R$ (def. ) by Noether's first theorem (prop. ), is up to a term which vanishes on-shell (52)

K \;\in\; \Omega^p_\Sigma(E \times_\Sigma \mathcal{G}) \phantom{AA}\,, \phantom{AA} K\vert_{\mathcal{E}^\infty} = 0 \,,

not just on-shell-conserved, but off-shell-conserved, in that its total spacetime derivative vanishes identically:

d( J_R - K ) \;=\; 0 \,.

Moreover, if the field bundle as well as the gauge parameter-bundles are trivial vector bundles over Minkowski spacetime (example ) then $J_R$ is horizontally exact on-shell (52)

J_R \vert_{\mathcal{E}^\infty} = d(...) \,.

In particular the conserved charge (prop. )

Q_R \;\coloneqq\; \tau_{\Sigma_p}(J_R) \;\in\; C^\infty\left( \Gamma_{\Sigma_p}(E)_{\delta_{EL}\mathbf{L} = 0} \right)

corresponding to an infinitesimal gauge symmetry vanishes on every codimension one submanifold $\Sigma_p \hookrightarrow \Sigma$ of spacetime (without boundary, $\partial \Sigma_p = \emptyset$ ):

Q_R = 0 \,.

Proof

Take $K$ to be as in equation (158):

d J_R = A + d K \,.

By the construction there, $K$ manifestly vanishes on the prolonged shell $\mathcal{E}^\infty$ (52), being a sum of total spacetime derivatives of terms proportional to the components of the Euler-Lagrange form.

By Noether's second theorem (prop. ) we have $A = 0$ and hence

d(J_R - K) = 0 \,.

Now if the field bundle and gauge parameter bundle are trivial, then prop. implies that

(160)

J_R - K = d(...) \,.

By restricting this equation to the prolonged shell and using that $K\vert_{\mathcal{E}^\infty} = 0$ , it follows that $J_R \vert_{\mathcal{E}^\infty} = d(...)$ .

This implies $Q_R = 0$ by prop. and Stokes' theorem (prop. ).

This situation has a concise cohomological incarnation:

Example

(Noether's theorems I and II in terms of local BV-cohomology)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime $\Sigma$ of dimension $p + 1$ , and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a gauge parameter bundle (def. ) which is closed (def. ). Assume that both are trivial vector bundles (example ) with field coordinates as in prop. .

Then in the local BV-complex (def. ) we have:

The $(s_{BV} + d)$ -closure of an element in total degree $p$ is characterizes as the direct sum of an evolutionary vector field which is an infinitesimal symmetry of the Lagrangian and theconserved current that corresponds to it under Noether's first theorem (prop. ).

Moreover, such a pair is $(s_{BV} + d)$ -exact precisely if the infinitesimal symmetry of the Lagrangian is in fact an infinitesimal gauge symmetry as witnessed by Noether's second theorem (prop. ).

(Barnich-Brandt-Henneaux 94, top of p. 20)

Proof

An element of the local BV-complex in degee $p$ is the direct sum of a horizontal differential form of degree $p$ with the product of a horizontal form of degree $(p+1)$ times a function proportional to the antifields:

\array{ \{J_v\} && \\ && \\ && \{ v^a \phi^\ddagger_a dvol_\Sigma\} }

Its closure means that

\array{ \{J_v\} &\overset{d}{\longrightarrow}& \{ \overset{= 0}{\overbrace{ d J_v - \iota_v \delta_{EL}\mathbf{L} }} \} \\ && \uparrow\mathrlap{s_{BV}} \\ && \{ v^a \phi^\ddagger_a dvol_\Sigma\} }

where the equality in the top right corner is euqation

It being exact means that

\array{ \left\{ ... \right\} & \overset{d}{\longrightarrow} & \left\{ J_R = K + d(...) \right\} &\overset{d}{\longrightarrow}& \left\{ d J_R \right\} \\ && \uparrow \\ && \left\{ K^{a \mu} \phi^\ddagger_a \iota_{\partial_\mu} dvol_\Sigma \right\} }

where now the equality in the second term from the left is equation (160) for conserved currents corresponding to infinitesimal gauge symmetries (prop. ).

We will need some further technical results on Noether identities:

Definition

(Noether operator)

A Noether operator $N$ is a differential operator (def. ) from the vertical cotangent bundle of $E$ (example ) to the trivial real line bundle

N(\omega) \;=\; \underset{k \in \mathbb{N}}{\sum} N^{a \mu_1 \cdots \mu_k} \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \omega_a

such that it annihilates the Euler-Lagrange form (prop. ):

\underset{k \in \mathbb{N}}{\sum} N^{a \mu_1 \cdots \mu_k} \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \frac{\delta_{EL} L}{\delta \phi^a} \;=\; 0 \,.

Given For $v$ an evolutionary vector field which is an infinitesimal symmetry of the Lagrangian (def. ), we define a new differentia opeator $v \cdot N$ by

(161)

(v \cdot N)^{a \mu_1 \cdots \mu_k} \;\coloneqq\; \widehat{v}\left( N^{a \mu_1 \cdots \mu_k} \right) \;-\; N^a \circ (\mathrm{D}_v)^\ast_a \,,

where $\widehat{v}$ denotes the prolongation of the evolutionary vector field $v$ (prop. ) and where $(\mathrm{D}_v)^\ast$ denotes the formally adjoint differential operator (def. ) of the evolutionary derivative of $v$ (def. ).

(Barnich 10 (3.1) and (3.5))

Proposition

(Lie algebra action of infinitesimal symmetries of the Lagrangian on Noether operators)

The operation (161) exhibits a Lie algebra action of the Lie algebra of infinitesimal symmetries of the Lagrangian (prop. ) on Noether operators (def. ), in that

$v \cdot N$ is again a Noether operator;
$v_1 \cdot (v_2 \cdot N) - v_2 \cdot (v_1 \cdot N) = [v_1, v_2] \cdot N$ .

Moreover, if $\rho$ denotes the map which identifies a Noether identity with an infinitesimal gauge symmetry by Noether's second theorem (def. ) then

(162)

\rho \left( v \cdot N \right) \;=\; \left[ v, \rho(N)\right] \,,

where on the right we have again the Lie bracket of evolutionary vector fields from (prop. ).

(Barnich 10, prop. 3.1 and (3.8))

Proof

For the first statement observe that by the product law for differentiation we have

\begin{aligned} 0 & = \widehat{v}\left( N(\delta_{EL} L) \right) \\ & = \widehat{v} \left( \underset{k \in \mathbb{N}}{\sum} N^{a \mu_1 \cdots \mu_k} \right) - \left( N^a \circ (\mathrm{D}_v)_a^b\left( \frac{\delta_{EL} L}{\delta \phi^a} \right) \right) \,, \end{aligned}

where on the right we used (82).

$\,$

Here are examples of infinitesimal gauge symmetries in Lagrangian field theory:

Example

(infinitesimal gauge symmetry of electromagnetic field)

Consider the Lagrangian field theory $(E,\mathbf{L})$ of free electromagnetism on Minkowski spacetime $\Sigma$ from example . With field coordinates denoted $((x^\mu), (a_\mu))$ the Lagrangian density is

\mathbf{L} \;=\; \tfrac{1}{2} f_{\mu \nu} f^{\mu \nu} \, dvol_\Sigma \,,

where $f_{\mu \nu} \coloneqq a_{\nu,\mu}$ is the universal Faraday tensor from example .

Let $\mathcal{G} \coloneqq \Sigma \times \mathbb{R}$ be the trivial line bundle, regarded as a gauge parameter bundle (def. ) with coordinate functions $((x^\mu), c)$ .

Then a gauge parametrized evolutionary vector field (157) is given by

R \;=\; c_{,\mu} \partial_{a_\mu}

with prolongation (prop. )

(163)

\widehat R \;=\; c_{,\mu} \partial_{a_\mu} + c_{,\mu \nu} \partial_{a_{\mu,\nu}} + \cdots \,.

This is because already the universal Faraday tensor is invariant under this flow:

\begin{aligned} \widehat {R} f_{\mu \nu} &= \tfrac{1}{2} c_{,\mu' \nu'} \partial_{a_{\mu',\nu'}} \left( a_{\nu, \mu} - a_{\mu,\nu} \right) \\ & = \tfrac{1}{2} \left( c_{,\nu\mu} - c_{,\mu \nu} \right) \\ & = 0 \,, \end{aligned}

because partial derivatives commute with each other: $c_{,\mu \nu} = c_{,\nu \mu}$ (29).

Equivalently, the Euler-Lagrange form

\delta_{EL}\mathbf{L} \;=\; \frac{d}{d x^\mu }f^{\mu \nu} \delta a_\nu \, dvol_\Sigma

of the theory (example ), corresponding to the vacuum Maxwell equations (example ), satisfies the following Noether identity (prop. ):

\frac{d}{d x^\mu} \frac{d}{d x^\nu} f^{\mu \nu} = 0 \,,

again due to the fact that partial derivatives commute with each other.

This is the archetypical infinitesimal gauge symmetry that gives gauge theory its name.

More generally:

Example

(infinitesimal gauge symmetry of Yang-Mills theory)

For $\mathfrak{g}$ a semisimple Lie algebra, consider the Lagrangian field theory of Yang-Mills theory on Minkowski spacetime from example , with Lagrangian density

\mathbf{L} \;=\; \tfrac{1}{2} f^\alpha_{\mu \nu} f_\alpha^{\mu \nu}

given by the universal field strength (31)

f^\alpha_{\mu \nu} \;\coloneqq\; \tfrac{1}{2} \left( a^\alpha_{[\nu,\mu]} + \tfrac{1}{2} \gamma^\alpha_{\beta \gamma} a^\beta_{[\mu} a^\gamma_{\nu]} \right) \,.

Let $\mathcal{G} \coloneqq \Sigma \times \mathfrak{g}$ be the trivial vector bundle with fiber $\mathfrak{g}$ , regarded as a gauge parameter bundle (def. ) with coordinate functions $((x^\mu), c^\alpha)$ .

Then a gauge parametrized evolutionary vector field (157) is given by

R \;=\; \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu}

with prolongation (prop. )

(164)

\widehat{R} \;=\; \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} \;+\; \left( c^\alpha_{,\mu \nu} - \gamma^\alpha_{\beta \gamma} \left( c^\beta_{,\nu} a^\gamma_\mu + c^\beta a^\gamma_{\mu,\nu} \right) \right) \partial_{a^\alpha_{\mu,\nu}} \;+\; \cdots \,.

We compute the derivative of the Lagrangian function along this vector field:

\begin{aligned} \widehat{R} \left( \tfrac{1}{2} f^\alpha_{\mu \nu} f_\alpha^{\mu \nu} \right) & = \left( R f^\alpha_{\mu \nu} \right) f_\alpha^{\mu \nu} \\ & = \left( R \left( a_{\nu,\mu} + \tfrac{1}{2}\gamma^\alpha_{\beta \gamma} a^\beta_{\mu} a^\gamma_{\nu} \right) \right) f_\alpha^{\mu \nu} \\ & = \left( c^\alpha_{,\nu \mu} - \gamma^\alpha_{\beta \gamma} \left( c^\beta_{,\mu} a^\gamma_\nu + c^\beta a^\gamma_{\nu,\mu} \right) + \gamma^\alpha_{\beta \gamma} \left( c^\beta_{,\mu} - \gamma^\beta_{\beta' \gamma'} c^{\beta'} a^{\gamma'}_\mu \right) a^\gamma_{\nu} \right) f_\alpha^{\mu \nu} \\ & = - \gamma^{\alpha}_{\beta \gamma} c^\beta \underset{ = 2 f^\gamma_{\mu \nu} }{ \underbrace{ \left( a^\gamma_{\nu,\mu} + \gamma^\gamma_{\beta' \gamma'} a^{\beta'}_\mu a^{\gamma'}_\nu \right) } } f_\alpha{}^{\mu \nu} \\ &= 2 \gamma_{\alpha \beta \gamma} c^\alpha f^\beta_{\mu \nu} f^{\gamma \mu \nu} \\ & = 0 \,. \end{aligned}

Here in the third step we used that $c^\alpha_{,\nu \mu} = + c^\alpha_{,\mu \nu}$ (29), so that its contraction with the skew-symmetric $f_\alpha^{\mu\nu}$ vanishes, and in the last step we used that for a semisimple Lie algebra $\gamma_{\alpha \beta \gamma} \coloneqq k_{\alpha \alpha'} \gamma^{\alpha'}{}_{\beta \gamma}$ is totally skew symmetric.

So the Lagrangian density of Yang-Mills theory is strictly invariant under these infinitesimal gauge symmetries.

Example

(infinitesimal gauge symmetry of the B-field)

Consider the Lagrangian field theory of the B-field on Minkowski spacetime from example , with field bundle the differential 2-form-bundle $E = \wedge^2_\Sigma T^\ast \Sigma$ with coordinates $((x^\mu), (b_{\mu \nu}))$ subject to $b_{\mu \nu} = - b_{\nu \mu}$ ; and with Lagrangian density

\mathbf{L} \;=\; \tfrac{1}{2} h_{\mu_1 \mu_2 \mu_3} h^{\mu_1 \mu_2 \mu_3} \, dvol_\Sigma

for

h_{\mu_1 \mu_2 \mu_3} = b_{[\mu_1 \mu_2, \mu_3]}

the universal B-field strength (example ).

Let $\mathcal{G} \coloneqq T^\ast \Sigma$ be the cotangent bundle (def. ), regarded as a gauge parameter bundle (def. ) with coordinate functions $((x^\mu), (c_\mu))$ as in example .

Then a gauge parametrized evolutionary vector field (157) is given by

R \;=\; c_{\mu,\nu} \partial_{b_{\mu \nu}}

with prolongation (prop. )

(165)

\widehat R \;=\; c_{\mu,\nu} \partial_{b_{\mu \nu}} + c_{\mu,\nu \rho} \partial_{b_{\mu \nu, \rho}} + \cdots

In fact this leaves the Lagrangian function invariant, in direct higher analogy to example :

\begin{aligned} \widehat{R} \tfrac{1}{2} h_{\mu_1 \mu_2 \mu_3} h^{\mu_1 \mu_2 \mu_3} & = \left( \widehat{R} b_{\mu_1 \mu_2, \mu_3} \right) h^{\mu_1 \mu_2 \mu_3} \\ & = c_{\mu_1, \mu_2 \mu_3} h^{\mu_1 \mu_2 \mu_3} \\ & = 0 \end{aligned}

due to the symmetry of partial derivatives (29).

h_{,\mu}\partial_{c_{\mu}} + h_{,\mu \nu}\partial_{c_{\mu,\nu}}

R_\alpha^{a, \mu} = c_{\mu,\nu} R^{\mu, \nu}_{\mu' \nu'} \partial_{b_{\mu' \nu'}} \,.

While so far all this is in direct analogy to the case of the electromagnetic field (example ), just with field histories being differential 1-forms now replaced by differential 2-forms, a key difference is that now the gauge parameterization $R$ itself has infinitesimal gauge symmetries:

Let

(166)

\array{ \overset{(2)}{\mathcal{G}} &\coloneqq& \Sigma \times \mathbb{R} \\ {}^{\overset{(2)}{gb}}\downarrow && \downarrow^{\mathrlap{pr_1}} \\ \Sigma &=& \Sigma }

be the trivial real line bundle with coordinates $((x^\mu), \overset{(2)}{c})$ , to be regarded as a second order infinitesimal gauge-of-gauge symmetry, then

\overset{(2)}{R} \;\coloneqq\; \overset{(2)}{c}_{,\mu} \partial_{c_\mu}

with prolongation

(167)

\widehat{\overset{(2)}{R}} \;\coloneqq\; \overset{(2)}{c}_{,\mu} \partial_{c_\mu} + \overset{(2)}{c}_{,\mu \nu} \partial_{c_{\mu,\nu}} + \cdots

has the property that

(168)

\begin{aligned} \widehat{\overset{(2)}{R}} (R) &= \overset{(2)}{c}_{,\mu \nu} \frac{\partial}{\partial c_{\mu,\nu}} \left( c_{\mu',\nu'} \partial_{b_{\mu' \nu'}} \right) \\ & = \overset{(2)}{c}_{,\mu \nu} \partial_{b_{\mu \nu}} \\ & 0 \,. \end{aligned}

We further discuss these higher gauge transformations below.

$\,$

Lie algebra actions and Lie algebroids

We have seen above infinitesimal gauge symmetries implied by a Lagrangian field theory, exhibited by infinitesimal symmetries of the Lagrangian. In order to remove the obstructions that these infinitesimal gauge symmetries cause for the existence of the covariant phase space (via prop. and remark ) we will need (discussed below in Gauge fixing) to make these symmetries manifest by hard-wiring them into the geometry of the type of fields. Mathematically this means that we need to take the homotopy quotient of the jet bundle of the field bundle by the action of the infinitesimal gauge symmetries, which is modeled by their action Lie algebroid.

Here we introduce the required higher Lie theory of Lie ∞-algebroids (def. below). Further below we specify this to actions by infinitesimal gauge symmetries to obtain the local BRST complex of a Lagrangian field theory (def. ) below.

$\,$

The following discussion introduces and uses the tremendously useful fact that (higher) Lie theory may usefully be dually expressed in terms of differential graded-commutative algebra (def. below), namely in terms of “Chevalley-Eilenberg algebras”. In the physics literature, besides the BRST-BV formalism, this fact underlies the D'Auria-Fré formulation of supergravity (“FDAs”, see the convoluted history of the concept). Mathematically the deep underlying phenomenon is called the “Koszul duality between the Lie operad and the commutative algebra operad”, but this need not concern us here. The phenomenon is readily seen in direct application:

Before we proceed, we make explicit a structure wich we already encountered in example .

Definition

(differential graded-commutative superalgebra)

A differential graded-commutative superalgebra is

a cochain complex $A_\bullet$ of super vector spaces, hence for each $n \in \mathbb{Z}$

1 a super vector space $A_n = (A_n)_{even} \oplus (A_n)_{odd}$ ;
1. a super-degree preserving linear map
  
  $d \;\colon\; A_{n} \longrightarrow A_{n+1}$
such that

$d \circ d = 0$

1, an associative algebra-structure on $A \coloneqq \underset{n \in \mathbb{Z}}{\oplus} A_n$

such that for all $a_1, a_2 \in A$ with homogenous bidegree $a_i \in (A_{n_a})_{\sigma_a}$ we have the super sign rule

$a b = (-1)^{n_a n_b} (-1)^{\sigma_a \sigma_b} \, b a$
$d(a b) = (d a) b + (-1)^{n_1} a (d b)$ .

A homomorphism between two differential graded-commutative superalgebras is a linear map between the underlying super vector spaces which preserves both degrees, and respects the product as well as the differential $d$ .

We write $dgcSAlg$ for the category of differential graded-commutative superalgebra.

For the super sigsn rule appearing here see also e.g. Castellani-D’Auria-Fré 91 (II.2.106) and (II.2.109), Deligne-Freed 99, section 6.

Example

(de Rham algebra of super differential forms is differential graded-commutative superalgebra)

For $X$ a super Cartesian space, def. (or more generally a supermanifold, def. ) the de Rham algebra of super differential forms from def.

(\Omega^\bullet(X), d)

is a differential graded-commutative superalgebra (def. ) with product the wedge product of differential forms and differential the de Rham differential.

We will recognize the dual incarnation of this in higher Lie theory below in example .

Proposition

(Lie algebra in terms of Chevalley-Eilenberg algebra)

Let $\mathfrak{g}$ be a finite dimensional super vector space equipped with a super Lie bracket $[-,-] \colon \mathfrak{g} \otimes \mathfrak{g} \to \mathfrak{g}$ . Write $\mathfrak{g}^\ast$ for the dual vector space and $[-,-]^\ast \;\colon\; \mathfrak{g}^\ast \to \mathfrak{g}^\ast \wedge \mathfrak{g}^\ast$ for the linear dual map of the Lie bracket. Then on the Grassmann algebra $\wedge^\bullet \mathfrak{g}^\ast$ (which is $\mathbb{Z} \times \mathbb{Z}/2$ -bigraded as in def. ) the graded derivation $d_{CE}$ of degree $(1,even)$ , which on $\mathfrak{g}^\ast$ is given by $[-,-]^\ast$ constitutes a differential in that $(d_{CE})^2 = 0$ . The resulting differential graded-commutative algebra is called the Chevalley-Eilenberg algebra

CE(\mathfrak{g}) \;\coloneqq\; \left( \wedge^\bullet \mathfrak{g}^\ast \,, d_{CE} = [-,-]^\ast \right) \,.

In components:

If $\{c_\alpha\}$ is a linear basis of $\mathfrak{g}$ , so that the Lie bracket is given by the structure constants $(\gamma^\alpha{}_{\beta \gamma})$ as

[c_\beta, c_\gamma] = \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c_\gamma

and if $\{c^\alpha\}$ denotes the corresponding dual basis, then $\wedge^\bullet \mathfrak{g}^\ast$ is equivalently the differential graded-commutative superalgebra (def. ) generated from the $c^\alpha$ in bi-degree $(1,\sigma)$ , where $\sigma \in \mathbb{Z}/2$ is the super-degree of $c_\alpha$ as in def. subject to the relation

c^\alpha \wedge c^\beta = (-1) (-1)^{\sigma_\alpha \sigma_\beta} c^\beta \wedge c^\alpha

and the differential is given by

d_{CE} c^\alpha = \gamma^\alpha{}_{\beta \gamma} c^\beta \wedge c^\gamma \,.

Notice that by degree-reasons every degree +1 derivation on $\wedge^\bullet \mathfrak{g}^\ast$ is of this form,

\left\{ \array{ \text{derivations} \\ \text{of degree}\, (1,even) \\ \text{on} \, \wedge^\bullet \mathfrak{g}^\ast } \right\} \;\;\simeq\;\, \left\{ \array{ \text{super-skew} \\ \text{bilinear maps} \\ \mathfrak{g} \otimes \mathfrak{g} \overset{[-,-]}{\longrightarrow} \mathfrak{g} } \right\}

The condition that $(d_{CE})^2 = 0$ is equivalently the (super-)Jacobi identity on the bracket $[-,-]$ , making it an actual (super-)Lie bracket:

(169)

(d_{CE})^2 = 0 \phantom{AAA} \Leftrightarrow \phantom{AAA} \gamma^\alpha{}_{\beta [\gamma} \gamma^{\beta}{}_{\beta' \gamma']} = 0

(where the square brackets on the right denote super-skew-symmetrization).

Hence not only is $CE(\mathfrak{g})$ a differential graded-commutative superalgebra (def. ) whenever $\mathfrak{g}$ is a super Lie algebra, but conversely super Lie algebra-structure on a super vector space $\mathfrak{g}$ is the same as a differential of degree $(1,even)$ on the Grassmann algebra $\wedge^\bullet \mathfrak{g}^\ast$ .

We may state this equivalence in a more refined form: A homomorphism $\phi \;\colon\; \mathfrak{g} \longrightarrow \mathfrak{h}$ between super vector space is, by degree-reasons, the same as a graded algebra homomorphism $\phi^\ast \;\colon\; \wedge^\bullet \mathfrak{h}^\ast \longrightarrow \wedge^\bullet \mathfrak{g}^\ast$ and it is immediate to check that $\phi$ is a homomorphism of super Lie algebras precisely if $\phi^\ast$ is a homomorpism of differential algebras:

d_{CE(\mathfrak{g})} \circ \phi^\ast = \phi^\ast \circ d_{CE(\mathfrak{h})} \phantom{AAA} \Leftrightarrow \phantom{AAA} \phi^{\alpha_1}{}_{\beta_1} \gamma^{\beta_1}_{\mathfrak{g}}{}_{\beta_2 \beta_3} = \gamma^{\alpha_1}_{\mathfrak{h}}{}_{\alpha_2 \alpha_3} \phi^{\alpha_2}{}_{\beta_2} \phi^{\alpha_3}{}_{\beta_3} \,.

This is a natural bijection between homomrophism of super Lie algebras and of differential graded-commutative superalgebras (def. )

Hom_{SuperLieAlg}( \mathfrak{g}, \mathfrak{h} ) \;\simeq\; Hom_{dgcSAlg}\left( CE(\mathfrak{h}), CE(\mathfrak{g}) \right) \,.

Stated more abstractly this means that forming Chevalley-Eilenberg algebras is a fully faithful functor

CE \;\colon\; SuperLieAlg^{fin} \overset{\phantom{AAA}}{\hookrightarrow} dgcSAlg^{op} \,.

Notice that prop. establishes a dual algebraic incarnation of (super-)Lie algebras which is of analogous form as the dual algebraic characterization of (super-)Cartesian spaces from prop. and def. . In fact both these concepts unify into the concept of an action Lie algebroid:

Definition

(action of Lie algebra by infinitesimal diffeomorphism)

Let $X$ be a supermanifold (def. ), for instance a super Cartesian space (def. ), and let $\mathfrak{g}$ be a finite dimensional super Lie algebra as in prop. .

An action of $\mathfrak{g}$ on $X$ by infinitesimal diffeomorphisms, is a homomorphism of super Lie algebras

\rho \;\colon \mathfrak{g} \longrightarrow ( Vect(X), [-,-] )

to the tangent vector fields on $X$ (example )

Equivalently – to bring out the relation to the gauge parameterized infinitesimal gauge transformations in def. – this is a $\mathfrak{g}$ -parameterized section

\array{ \mathfrak{g} \times X && \overset{R}{\longrightarrow} && T X \\ & {\mathllap{pr_2}}\searrow && \swarrow_{\mathrlap{p}} \\ && X }

of the tangent bundle, such that for all pairs of points $e_1, e_2$ in $\mathfrak{g}$ we have

\left[R(e_1,-), R(e_2,-)\right] = R([e_1,e_2],-)

(with the Lie bracket of tangent vector fields on the left).

In components:

If $\{c^\alpha\}$ is a linear basis of $\mathfrak{g}^\ast$ with corresponding structure constants $(\gamma^\alpha{}_{\beta \gamma})$ (as in prop. ) and if $\{\phi^a\}$ is a coordinate chart of $X$ , then $R$ is given by

R \;=\; c^\alpha R_\alpha^a \frac{\partial}{\partial \phi^a} \,.

Now the construction of the Chevalley-Eilenberg algebra of a super Lie algebra (prop. ) extends to the case where this super Lie algebra acts on a supermanifold (def. ):

Definition

(action Lie algebroid)

Given a Lie algebra action

\mathfrak{g} \times X \overset{R}{\longrightarrow} T X

of a finite-dimensional super Lie algebra $\mathfrak{g}$ on a supermanifold $X$ (def. ) we obtain a differential graded-commutative superalgebra to be denoted $CE(X/\mathfrak{g})$

whose underlying graded-commutative superalgebra is the Grassmann algebra of the $C^\infty(X)$ -free module on $\mathfrak{g}^\ast$ over $C^\infty(X)$

$\wedge^\bullet_{C^\infty(X)} (\mathfrak{g}^\ast \otimes C^\infty(X)) \;=\; \underset{ deg = 0 }{ \underbrace{ C^\infty(X) }} \oplus \underset{ deg = 1 }{ \underbrace{ C^\infty(X) \otimes \mathfrak{g}^\ast }} \oplus \underset{ def = 2 }{ \underbrace{ C^\infty(X) \otimes \mathfrak{g}^\ast \wedge \mathfrak{g}^\ast }} \oplus \cdots$

which means that the graded manifold underlying the action Lie algebroid according to remark is

(170) $X/\mathfrak{g} \;=_{grmfd}\; \mathfrak{g}[1] \times X \,,$
whose differential $d_{CE}$ is given
1. on functions $f \in C^\infty(X)$ by the linear dual of the Lie algebra action
$d_{CE} f \coloneqq \rho(-)(f) \in C^\infty(X) \otimes \mathfrak{g}^\ast$
1. on dual Lie algebra elements $\omega \in \mathfrak{g}^\ast$ by the linear dual of the Lie bracket
  
  $d_{CE} \omega \coloneqq \omega([-,-]) \;\in \; \mathfrak{g}^\ast \wedge \mathfrak{g}^\ast \,.$

In components:

Assume that $X = \mathbb{R}^n$ is a super Cartesian space with coordinate functions $(\phi^a)$ and let $\{c_\alpha\}$ be a linear basis for $\mathfrak{g}$ with dual basis $(c^\alpha)$ for $\mathfrak{g}^\ast$ and structure constants $(\gamma^\alpha){}_{\beta \gamma}$ as in prop. and with the Lie action given in components $(R_\alpha^a)$ as in def. . Then the differential is given by

\begin{aligned} d_{CE(X/\mathfrak{g})} c^\alpha & = \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} \, c^\beta \wedge c^\gamma \\ d_{CE(X/\mathfrak{g})} \phi^a & = R^a_\alpha c^\alpha \end{aligned}

We may summarize this by writing the derivation $d_{CE(X/\mathfrak{g})}$ as follows:

(171)

d_{CE(X/\mathfrak{g})} \;=\; c^\alpha R_\alpha^a \frac{\partial}{\partial \phi^a} \;+\; \tfrac{1}{2} \gamma^{\alpha}{}_{\beta \gamma} c^\beta c^\gamma \frac{\partial}{\partial c^\alpha} \,.

That this squares to zero is equivalently

in degree 0 the action property: $\rho([t, t']) = [\rho(t), \rho(t')]$
in degree 1 the Jacobi identity (169).

(d_{CE(X/\mathfrak{g})})^2 = 0 \phantom{AAA} \Leftrightarrow \phantom{AAA} \array{ \phantom{and} \, \text{Jacobi identity} \\ \text{and} \, \text{action property} }

Hence as before in prop. the Lie theoretic structure is faithfully captured dually by differential graded-commutative superalgebra.

We call the formal dual of this dgc-superalgebra the action Lie algebroid $X/\mathfrak{g}$ of $\mathfrak{g}$ acting on $X$ .

The concept emerging by this example we may consider generally:

Definition

(super-Lie ∞-algebroid)

Let $X$ be a supermanifold (def. ) (for instance a super Cartesian space, def. ) and write $C^\infty(X)$ for its algebra of functions. Then a connected super Lie ∞-algebroid $\mathfrak{a}$ over $X$ of finite type is a

a sequence $(\mathfrak{a}_k)_{ k = 1 }^\infty$ of free modules of finite rank over $C^\infty(X)$ , hence a graded module $\mathfrak{a}_\bullet$ in degrees $k \in \mathbb{N}$ ; $k \geq 1$
a differential $d_{CE}$ that makes the graded-commutative algebra $Sym_{C^\infty(X)}(\mathfrak{a}^\ast_\bullet)$ into a cochain differential graded-commutative algebra (hence with $d_{CE}$ of degree +1) over $\mathbb{R}$ (not necessarily over $C^\infty(X)$ ), to be called the Chevalley-Eilenberg algebra of $\mathfrak{a}$ :

(172) $CE(\mathfrak{a}) \;\coloneqq\; \left( Sym_{C^\infty(X)}(\mathfrak{a}^\ast_\bullet) \,,\, d_{CE} \right) \,.$

If we allow $\mathfrak{a}_\bullet$ to also have terms in non-positive degree, then we speak of a derived Lie algebroid. If $\mathfrak{a}_\bullet$ is only concentrated in negative degrees, we also speak of a derived manifold.

With $C^\infty(X)$ canonically itself regarded as a differential graded-commutative superalgebra, there is a canonical dg-algebra homomorphism

CE(\mathfrak{a}) \longrightarrow C^\infty(X)

which is the identity on $C^\infty(X)$ and zero on $\mathfrak{a}^\ast_{\neq 0}$ .

(We discuss homomorphism between Lie ∞-algebroid below in def. .)

Remark

(Lie algebroids as differential graded manifolds)

Definition of derived Lie algebroids is an encoding in higher algebra (homological algebra, in this case) of a situation that is usefully thought of in terms of higher differential geometry.

To see this, recall the magic algebraic properties of ordinary differential geometry (prop. )

Together these imply that we may think of the graded algebra underlying a Chevalley-Eilenberg algebra as being the algebra of functions on a graded manifold

\cdots \times \mathfrak{a}_2 \times \mathfrak{a}_1 \times X \times \mathfrak{a}_{-1} \times \cdots

which is infinitesimal in non-vanishing degree.

The “higher” in higher differential geometry refers to the degrees higher than zero. See at Higher Structures for exposition. Specifically if $\mathfrak{a}_\bullet$ has components in negative degrees, these are also called derived manifolds.

Example

(basic examples of Lie algebroids)

Two basic examples of Lie algebroids are:

For $X$ any supermanifold (def. ), for instance a super Cartesian space (def. ) then setting $\mathfrak{a}_{\neq 0 } \coloneqq 0$ and $d_{CE} \coloneqq 0$ makes it a Lie algebroid in the sense of def. .
For $\mathfrak{g}$ a finite-dimensional super Lie algebra, its Chevalley-Eilenberg algebra (prop. ) $CE(\mathfrak{g})$ exhibits $\mathfrak{g}$ as a Lie algebroid in the sense of def. . We write $B\mathfrak{g}$ or $\ast/\mathfrak{g}$ for $\mathfrak{g}$ regarded as a Lie algebroid this way.
For $X$ and $\mathfrak{g}$ as in the previous items, and for $R \colon \mathfrak{g} \times X \to T X$ a Lie algebra action (def. ) of $\mathfrak{g}$ on $X$ , then the dgs-superalegbra $CE(X/\mathfrak{g})$ from def. defines a Lie algebroid in the sense of def. , the action Lie algebroid.

In the special case that $\mathfrak{g} = 0$ this reduces to the first example, while for $X = \ast$ this reduces to the second example.

Here is another basic examples of Lie algebroids that will plays a role:

Example

(horizontal tangent Lie algebroid)

Let $\Sigma$ be a smooth manifold or more generally a supermanifold or more generally a locally pro-manifold (prop. ). Then we write $\Sigma/T\Sigma$ for the Lie algebroid over $X$ and whose Chevalley-Eilenberg algebra is generated over $C^\infty(X)$ in degree 1 from the module

\mathfrak{a}_1^\ast \coloneqq (\Gamma(T \Sigma))^\ast \simeq\Gamma(T^\ast \Sigma) = \Omega^1(\Sigma)

of differential 1-forms and whose Chevalley-Eilenberg differential is the de Rham differential, so that the Chevalley-Eilenberg algebra is the de Rham dg-algebra of super differential forms (example )

CE( \Sigma/T\Sigma ) \coloneqq (\Omega^\bullet(\Sigma), d_{dR}) \,.

This is called the tangent Lie algebroid of $\Sigma$ . As a graded manifold (via remark ) this is called the “shifted tangent bundle” $T[1] \Sigma$ of $X$ .

More generally, let $E \overset{fb}{\to} \Sigma$ be a fiber bundle over $\Sigma$ . Then there is a Lie algebroid $J^\infty_\Sigma(E)/T\Sigma$ over the jet bundle of $E$ (def. ) defined by its Chevalley-Eilenberg algebra being the horizontal part of the variational bicomplex (def. ):

CE\left( J^\infty_\Sigma(E)/T\Sigma \right) \;\coloneqq\; \left(\Omega^{\bullet,0}_\Sigma(E), d\right) \,.

The underlying graded manifold of $J^\infty_\Sigma(E)/T\Sigma$ is the fiber product $J^\infty_\Sigma(E)\times_\Sigma T[1]\Sigma$ of the jet bundle of $E$ with the shifted tangent bundle of $\Sigma$ .

There is then a canonical homomorphism of Lie algebroids (def. )

\array{ J^\infty_\Sigma(E)/T\Sigma \\ \downarrow \\ \Sigma/T\Sigma }

$\,$

local off-shell BRST complex

With the general concept of Lie algebra action (def. ) and the corresponding action Lie algebroids (def. ) and more general Lie ∞-algebroids in hand (def. ) we now apply this to the action of infinitesimal gauge symmetries (def. ) on field histories of a Lagrangian field theory, but we consider this locally, namely on the jet bundle. The Chevalley-Eilenberg algebra of the resulting action Lie algebroid (def. ) is known as the local BRST complex, example below.

The Lie algebroid-perspective on BV-BRST formalism has been made explicit in (Barnich 10).

Definition

(closed gauge parameters)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ). Then a gauge parameter bundle $\mathcal{G} \overset{gb}{\to} \Sigma$ parameterizing infinitesimal gauge symmetries (def. )

J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E

is called closed if it is closed under the Lie bracket of evolutionary vector fields (prop. ) in that there exists a morphism (not necessarily uniquely)

(173)

[-,-]_{\mathcal{G}} \;\colon\; J^\infty_\Sigma( \mathcal{G} \times_\Sigma \mathcal{G} \times_\Sigma E ) \longrightarrow J^\infty_\Sigma(\mathcal{G} \times_\Sigma E)

such that

\left[ R(-) , R(-)\right] \;=\; R([-,-]_{\mathcal{G}}) \,,

where on the left we have the Lie bracket of evolutionary vector fields from prop. .

Beware that $[-,-]_{\mathcal{G}}$ may be a function of the fields, namely of the jet bundle of the field bundle $E$ . Hence for closed gauge parameters $[-,-]_{\mathcal{G}}$ in general defines a Lie algebroid-structure (def. ).

Notice that the collection of all infinitesimal symmetries of the Lagrangian by necessity always forms a (very large) Lie algebra. The condition of closed gauge parameters is a condition on the choice of parameterization of the infinitesimal gauge symmetries, see remark .

(Henneaux 90, section 2.9)

Recall the general concept of a Lie algebra action from def. . The following realizes this for the action of closed infinitesimal gauge symmetries on the jet bundle of a Lagrangian field theory.

Example

(action of closed infinitesimal gauge symmetries on fields)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ), and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a bundle of gauge parameters (def. ) paramaterizing infinitesimal gauge symmetries

J^\infty_\Sigma(\mathcal{G} \times_\Sigma E) \overset{R}{\longrightarrow} T_\Sigma E

which are closed (def. ), via a bracket $[-,-]_{\mathcal{G}}$ .

By passing from these evolutionary vector fields $R$ (def. ) to their prolongations $\widehat{R}$ , being actual vector fields on the jet bundle (prop. ), we obtain a bundle morphism of the form

\array{ J^\infty_\Sigma(\mathcal{G}) \times_\Sigma J^\infty_\Sigma (E) && \overset{\widehat{R(e)}}{\longrightarrow} && T_\Sigma J^\infty_\Sigma(E) \\ & \searrow && \swarrow \\ && J^\infty_\Sigma(E) }

and via the assumed bracket $[-,-]_{\mathcal{G}}$ on gauge parameters this exhibits Lie algebroid structure on $J^\infty_\Sigma(\mathcal{G}) \times_\Sigma J^\infty_\Sigma(E) \overset{pr_2}{\to} J^\infty_\Sigma(E)$ .

In the case that $\mathcal{G} = \mathfrak{g} \times \Sigma$ is a trivial vector bundle, with fiber $\mathfrak{g}$ , then so is its jet bundle

J^\infty_\Sigma(\mathfrak{g} \times \Sigma) = \mathfrak{g}^\infty \times \Sigma \,.

If moreover the bracket (173) on the infinitesimal gauge symmetries is independent of the fields, then this induces a Lie algebra structure on $\mathfrak{g}^\infty$ and exhibits an Lie algebra action

\array{ \mathfrak{g}^\infty \times J^\infty_\Sigma E && \overset{\widehat{R(e)}}{\longrightarrow} && T_\Sigma J^\infty_\Sigma(E) \\ & \searrow && \swarrow \\ && J^\infty_\Sigma(E) } \,.

of the gauge parameterized infinitesimal gauge symmetries on the jet bundle of the field bundle by infinitesimal diffeomorphisms.

Example

(local BRST complex and ghost fields for closed infinitesimal gauge symmetries)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ), and let $\mathcal{G} \overset{gb}{\longrightarrow} \Sigma$ be a bundle of irreducible closed gauge parameters for the theory (def. ) with bundle morphism

\array{ J^\infty_\Sigma( \mathcal{G} \times_\Sigma E ) && \overset{R}{\longrightarrow} && T_\Sigma E \\ & \searrow && \swarrow \\ && E } \,.

Assuming that the gauge parameter bundle is trivial, $\mathcal{G} = \mathfrak{g} \times \Sigma$ , then by example this induces an action $\hat R$ of a Lie algebra $\mathfrak{g}^\infty$ on $J^\infty_\Sigma E$ by infinitesimal diffeomorphisms.

The corresponding action Lie algebroid $J^\infty_\Sigma(E)/\mathfrak{g}^\infty$ (def. ) has as underlying graded manifold (remark )

\mathfrak{g}^\infty[1] \times J^\infty_\Sigma(E) \;\simeq\; J^\infty_\Sigma( \mathcal{G}[1] \times_\Sigma E )

the jet bundle of the graded field bundle

E_{BRST} \;\coloneqq\; E \times_\Sigma \mathcal{G}[1]

which regards the gauge parameters as fields in degree 1. As such these are called ghost fields:

\left\{ \text{ghost field histories} \right\} \;\coloneqq\; \Gamma_\Sigma( \mathcal{G}[1] ) \,.

Therefore we write suggestively

E/\mathcal{G} \;\coloneqq\; J^\infty_\Sigma(E)/\mathfrak{g}^\infty

for the action Lie algebroid of the gauge parameterized implicit infinitesimal gauge symmetries on the jet bundle of the field bundle.

The Chevalley-Eilenberg differential of the BRST complex is traditionally denoted

s_{BRST} \coloneqq d_{CE} \,.

To express this in coordinates, assume that the field bundle $E$ as well as the gauge parameter bundle are trivial vector bundles (example ) with $(\phi^a)$ the field coordinates on the fiber of $E$ with induced jet coordinates $((x^\mu), (\phi^a), (\phi^a_{\mu}), \cdots)$ and $(c^\alpha)$ are ghost field coordinates on the fiber of $\mathcal{G}[1]$ with induced jet coordinates $((x^\mu), (c^\alpha), (c^\alpha_\mu), \cdots)$ .

Then in terms of the corresponding coordinate expression for the gauge symmetries $R$ (157) the BRST differential is given on the fields by

s_{BRST} \, \phi^a \;=\; c^\alpha_{,\mu_1 \cdots \mu_k} \underset{k \in \mathbb{N}}{\sum} R^{a \mu_1 \cdots \mu_k}_{\alpha}

and on the ghost fields by

s_{BRST} \, c^\alpha = \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma \,,

and it extends from there, via prop. , to jets of fields and ghost fields by (anti-)commutativity with the total spacetime derivative.

Moreover, since the action of the infinitesimal gauge symmetries is by definition via prolongations (prop. ) of evolutionary vector fields (def. ) and hence compatible with the total spacetime derivative (75) this construction descends to the horizontal tangent Lie algebroid $J^\infty_\Sigma(E)/T\Sigma$ (example ) to yield

E/(\mathcal{G}\times_\Sigma T \Sigma) \;\coloneqq\; \left(J^\infty_\Sigma(E)/T\Sigma\right)/\mathfrak{g}^\infty

The Chevalley-Eilenberg differential on $E/(\mathcal{G}\times_\Sigma T \Sigma)$ is

d - s_{BRST}

The Chevalley-Eilenberg algebra of functions on this differential graded manifold (172) is called the off-shell local BRST complex.

(Barnich-Brandt-Henneaux 94, Barnich 10 (35)).

Definition

(global BRST complex)

We may pass from the off-shell local BRST complex (def. ) on the jet bundle to the “global” BRST complex by transgression of variational differential forms (def. ):

Write $Obs(E \times_\Sigma \mathcal{G}[1])$ for the induced graded off-shell algebra of observables (def. ). For $A \in \Omega^{p+1,\bullet}_\Sigma(E \times_\Sigma \mathcal{G}[1])$ with corresponding local observable $\tau_\Sigma(A) \in LocObs_\Sigma(E \times_\Sigma \mathcal{G}[1])$ its BRST differential is defined by

s_{BRST} \tau_{\Sigma}(A) \;\coloneqq\; \tau_{\Sigma}(s_{BRST} A)

and extended from there to $Obs(E \times_\Sigma \mathcal{G}[1])$ as a graded derivation.

$\,$

Examples of local BRST complexes of Lagrangian gauge theories

Example

(local BRST complex for free electromagnetic field on Minkowski spacetime)

Consider the Lagrangian field theory of free electromagnetism on Minkowski spacetime (example ) with its gauge parameter bundle as in example .

By (163) the action of the BRST differential is the derivation

s_{BRST} \;=\; c_{,\mu} \frac{\partial}{\partial a_\mu} + c_{, \mu \nu} \frac{\partial}{\partial a_{\mu, \nu}} + \cdots \,.

In particular the Lagrangian density is BRST-closed

\begin{aligned} s_{BRST} \mathbf{L} & = s_{BRST} f_{\mu \nu} f^{\mu \nu} dvol_\Sigma \\ & = c_{,\mu \nu} f^{\mu \nu} dvol_\Sigma \\ & = 0 \end{aligned}

as is the Euler-Lagrange form (due to the symmetry $c_{,\mu \nu} = c_{,\nu \mu}$ (29) and in contrast to the skew-symmetry $f_{\mu \nu} = - f_{\nu \mu}$ ).

Example

(local BRST complex for the Yang-Mills field on Minkowski spacetime)

For $\mathfrak{g}$ a semisimple Lie algebra, consider the Lagrangian field theory of Yang-Mills theory on Minkowski spacetime from example , with Lagrangian density

\mathbf{L} \;=\; \tfrac{1}{2} f^\alpha_{\mu \nu} f_\alpha^{\mu \nu}

given by the universal field strength (31)

f^\alpha_{\mu \nu} \;\coloneqq\; \tfrac{1}{2} \left( a^\alpha_{[\nu,\mu]} + \tfrac{1}{2} \gamma^\alpha_{\beta \gamma} a^\beta_{[\mu} a^\gamma_{\nu]} \right) \,.

R \;=\; \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu}

from example .

We claim that these are closed gauge parameters in the sense of def. , hence that the local BRST complex in the form of example exists.

To see this, observe that, by def. the candidate BRST differential needs to be of the form (164) plus the linear dual of the Lie bracket $[-,-]_{\mathcal{G}}^\ast$

s_{BRST} \;=\; \left( \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} \;+\; \text{prolongation} \right) + ([-,-]_{\mathcal{G}})^\ast \,.

Moreover, by def. we may equivalently make an Ansatz for $([-,-]_{\mathcal{G}})^\ast$ and if the resulting differential $s_{BRST}$ squares to zero, as this dually defines the required closure bracket $[-,-]_\mathcal{G}$ .

We claim that

(174)

s_{BRST} \;\coloneqq\; \widehat{ \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \frac{\partial}{\partial a^\alpha_\mu} } + \widehat{ \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} \, c^\beta c^\gamma \frac{\partial}{\partial c^\alpha} } \,,

where the hat denotes prolongation (prop. ). This is the local (jet bundle) BRST differential for Yang-Mills theory on Minkowski spacetime.

(e.g. Barnich-Brandt-Henneaux 00 (7.2))

Proof

We need to show that (174) squares to zero. Consider the two terms that appear:

(s_{BRST})^2 = \left[ \widehat{ \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} } \;,\; \widehat{ \left( c^{\alpha'}_{,\mu} - \gamma^{\alpha'}_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^{\alpha'}_\mu} } \right] \;+\; 2 \left[ \widehat{ \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} } \;,\; \widehat{ \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} \, c^\beta c^\gamma \frac{\partial}{\partial c^\alpha} } \right] \,.

The first term is

\begin{aligned} \left[ \widehat{ \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} } \;,\; \widehat{ \left( c^{\alpha'}_{,\mu} - \gamma^{\alpha'}_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^{\alpha'}_\mu} } \right] & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta \left( c^\gamma_{,\mu} - \gamma^\gamma_{\beta' \gamma'} c^{\beta'} a^{\gamma'}_\mu \right) \frac{\partial}{\partial a^{\alpha'}_\mu} } \\ & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta c^\gamma_{,\mu} \frac{\partial}{\partial a^{\alpha'}_\mu} } + 2 \gamma^{\alpha'}_{\beta \gamma} \gamma^\gamma_{\beta' \gamma'} \widehat{ c^\beta c^{\beta'} a^{\gamma'}_\mu \frac{\partial}{\partial a^{\alpha'}_\mu} } \\ & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta c^\gamma_{,\mu} \frac{\partial}{\partial a^{\alpha'}_\mu} } + \gamma^{\alpha'}_{\beta \gamma} \gamma^\gamma_{\beta' \gamma'} \widehat{ \left( c^\beta c^{\beta'} a^{\gamma'}_\mu - c^{\beta'} c^{\beta} a^{\gamma'}_\mu \right) \frac{\partial}{\partial a^{\alpha'}_\mu} } \\ & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta c^\gamma_{,\mu} \frac{\partial}{\partial a^{\alpha'}_\mu} } + \gamma^{\alpha'}_{\beta \gamma} \gamma^\gamma_{\beta' \gamma'} \widehat{ \left( - c^\beta c^{\gamma'} a^{\beta'}_\mu - c^{\beta'} c^{\beta} a^{\gamma'}_\mu \right) \frac{\partial}{\partial a^{\alpha'}_\mu} } \\ & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta c^\gamma_{,\mu} \frac{\partial}{\partial a^{\alpha'}_\mu} } + \gamma^{\alpha'}_{\beta \gamma} \gamma^\gamma_{\beta' \gamma'} \widehat{ c^{\gamma'} c^{\beta'} a^{\beta}_\mu \frac{\partial}{\partial a^{\alpha'}_\mu} } \\ & = - 2 \gamma^{\alpha'}_{\beta \gamma} \widehat{ c^\beta c^\gamma_{,\mu} \frac{\partial}{\partial a^{\alpha'}_\mu} } + \gamma^{\alpha'}_{\gamma \beta} \gamma^\beta_{\beta' \gamma'} \widehat{ c^{\beta'} c^{\gamma'} a^{\gamma}_\mu \frac{\partial}{\partial a^{\alpha'}_\mu} } \end{aligned}

Here first we expanded out, then in the second-but-last line we used the Jacobi identity (169) and in the last line we adjusted indices, just for convenience of comparison with the next term. That next term is

\left[ \widehat{ \left( c^\alpha_{,\mu} - \gamma^\alpha_{\beta \gamma} c^\beta a^\gamma_\mu \right) \partial_{a^\alpha_\mu} } \;,\; \gamma^\alpha{}_{\beta \gamma} \, \widehat{c^\beta c^\gamma \frac{\partial}{\partial c^\alpha}} \right] = 2 \gamma^\alpha_{\beta \gamma} \widehat{ c^\beta_{,\mu} c^\gamma \frac{\partial}{\partial a^\alpha_\mu} } - \gamma^\alpha_{\beta \gamma} \gamma^\beta_{\beta' \gamma'} \widehat{ c^{\beta'} c^{\gamma'} a^\gamma_\mu \frac{\partial}{\partial a^\alpha_\mu} } \,,

where the first summand on the right comes from the prolongation.

This shows that the two terms cancel.

Example

(local BRST complex for B-field on Minkowski spacetime)

By example the local BRST complex (example ) has BRST differential of the form

c_{\mu, \nu} \frac{\partial}{\partial b_{\mu \nu}} + c_{\mu,\nu_1 \nu_2} \frac{\partial}{\partial b_{\mu \nu_1, \nu_2}} + \cdots \,.

In this case this enhanced to an Lie 2-algebroid by regarding the second-order gauge parameters (166) in degree 2 to form a graded field bundle

\underset{ \{\overset{(2)}{c}\} }{ \underbrace{ \overset{(2)}{\mathcal{G}}[2] }} \times_\Sigma \underset{\{c_\mu\}}{ \underbrace{ \mathcal{G}[1] } } \times_\Sigma \underset{ (b_{\mu \nu}) }{ \underbrace{ E }} \;=\; \mathbb{R}[2] \times T^\ast \Sigma [1] \times_\Sigma E

by adding the ghost-of-ghost field $(\overset{(2)}{c})$ (167) and taking the local BRST differential to be the sum of the first order infinitesimal gauge symmetries (165) and the second order infinitesimal gauge-of-gauge symmetry (167):

s_{BRST} \;=\; \left( c_{\mu, \nu} \frac{\partial}{\partial b_{\mu \nu}} + c_{\mu,\nu_1 \nu_2} \frac{\partial}{\partial b_{\mu \nu_1, \nu_2}} + \cdots \right) + \left( \overset{(2)}{c}_{,\mu} \frac{\partial}{\partial c_\mu} + \overset{(2)}{c}_{,\mu \nu} \frac{\partial}{\partial c_{\mu,\nu}} + \cdots \right) \,.

Notice that this indeed still squares to zero, due to the second-order Noether identity (168):

\begin{aligned} \left( s_{BSRT} \right)^2 & = \left[ \overset{(2)}{c}_{,\mu \nu} \frac{\partial}{\partial c_{\mu,\nu}}, c_{\mu, \nu} \frac{\partial}{\partial b_{\mu \nu}} \right] \;+\; \left[ \overset{(2)}{c}_{,\mu \nu_1 \nu_2} \frac{\partial}{\partial c_{\mu,\nu_1 \nu_2}}, c_{\mu, \nu_1 \nu_2} \frac{\partial}{\partial b_{\mu \nu_1, \nu_2}} \right] \\ & = \underset{ = 0 }{ \underbrace{ \overset{(2)}{c}_{,\mu \nu} \frac{\partial}{\partial b_{\mu \nu}} }} \;+\; \underset{ = 0 }{ \underbrace{ \overset{(2)}{c}_{,\mu \nu_1 \nu_2} \frac{\partial}{\partial b_{\mu \nu_1, \nu_2}} }} \;+\; \cdots \\ & = 0 \,. \end{aligned}

$\,$

This concludes our discussion of infinitesimal gauge symmetries, their off-shell action on the jet bundle of the field bundle and the corresponding homotopy quotient exhibited by the local BRST complex. In the next chapter we discuss the homotopy intersection of this construction with the shell: the reduced phase space.

Reduced phase space

In this chapter we discuss these topics:

Global gauge reduction for strictly invariant functions (action functionals):
- Derived critical loci inside Lie algebroids
- Schouten bracket on Lie algebroids
Local gauge reduction for weakly invariant local functions (Lagrangian densities):

For a Lagrangian field theory with infinitesimal gauge symmetries, the reduced phase space is the quotient of the shell (the solution-locus of the equations of motion) by the action of the gauge symmetries; or rather it is the combined homotopy quotient by the gauge symmetries and its homotopy intersection with the shell. Passing to the reduced phase space may lift the obstruction for a gauge theory to have a covariant phase space and hence a quantization.

The higher differential geometry of homotopy quotients and homotopy intersections is usefully modeled by tools from homological algebra, here known as the BV-BRST complex.

In order to exhibit the key structure without getting distracted by the local jet bundle geometry, we first discuss the simple form in which the reduced phase space would appear after transgression (def. ) if spacetime were compact, so that, by the principle of extremal action (prop. ), it would be the derived critical locus ( $d S \simeq 0$ ) of a globally defined action functional $S$ . This “global” version of the BV-BRST complex is example below.

The genuine local construction of the derived shell is in the jet bundle of the field bundle, where the action functional appears “de-transgressed” in the form of the Lagrangian density, which however is invariant under gauge transformations generally only up to horizontally exact terms. This local incarnation of the redcuced phase space is modeled by the genuine local BV-BRST complex, example below.

Finally, under transgression of variational differential forms this yields a differential on the graded local observables of the field theory. This is the global BV-BRST complex of the Lagrangian field theory (def. below).

$\,$

derived critical loci inside Lie algebroids

By analogy with the algebraic formulation of smooth functions between Cartesian spaces (the embedding of Cartesian spaces into formal duals of R-algebras, prop. ) it is clear how to define a map (homomorphism) between Lie algebroids:

Definition

(homomorphism between Lie algebroids)

Given two derived Lie algebroids $\mathfrak{a}$ , $\mathfrak{a}'$ (def. ), then a homomorphism between them

f \;\colon\; \mathfrak{a} \longrightarrow \mathfrak{a}'

is a dg-algebra-homomorphism between their Chevalley-Eilenberg algebras going the other way around

CE(\mathfrak{a}) \longleftarrow CE(\mathfrak{a}') \;\colon\; f^\ast

such that this covers an algebra homomorphism on the function algebras:

\array{ CE(\mathfrak{a}) &\overset{f^\ast}{\longleftarrow}& CE(\mathfrak{a}') \\ \downarrow && \downarrow \\ C^\infty(X) &\underset{(f\vert_X)^\ast}{\longleftarrow}& C^\infty(Y) } \,.

(This is also called a “non-curved sh-map”.)

Example

(invariant functions in terms of Lie algebroids)

Let $\mathfrak{g}$ be a super Lie algebra equipped with a Lie algebra action (def. )

\array{ \mathfrak{g} \times X && \overset{R}{\longrightarrow} && T X \\ & {}_{\mathllap{pr_2}}\searrow && \swarrow_{\mathrlap{rb}} \\ && X }

on a supermanifold $X$ . Then there is a canonical homomorphism of Lie algebroids (def. )

(175)

\array{ X &&& CE(X) &=& C^\infty(X) &\oplus& 0 \\ \downarrow^{\mathrlap{p}} &\phantom{AAA}&& \uparrow^{\mathrlap{p^\ast}} && \uparrow^{\mathrlap{id}} && \uparrow^{\mathrlap{0}} \\ X/\mathfrak{g} &&& CE(X/\mathfrak{g}) &=& C^\infty(X) &\oplus& C^\infty(X) \otimes \wedge^\bullet \mathfrak{g}^\ast }

from the manifold $X$ regarded as a Lie algebroid by example to the action Lie algebroid $X/\mathfrak{g}$ (example ), which may be called the homotopy quotient coprojection map. The dual homomorphism of differential graded-commutative superalgebras is given simply by the identity on $C^\infty(X)$ and the zero map on $\mathfrak{g}^\ast$ .

Next regard the real line manifold $\mathbb{R}^1$ as a Lie algebroid by example . Then homomorphisms of Lie algebroids (def. ) of the form

S \;\colon\; X/\mathfrak{g} \longrightarrow \mathbb{R}^1 \,,

hence smooth functions on the Lie algebroid, are equivalently

ordinary smooth functions $S \;\colon\; X \longrightarrow \mathbb{R}^1$ on the underlying smooth manifold,
which are invariant under the Lie algebra action in that $R(-)(S) = 0$ .

In terms of the canonical homotopy quotient coprojection map $p$ (175) this says that a smooth function on $X$ extension extends to the action Lie algebroid precisely if it is invariant:

\array{ X &\overset{S}{\longrightarrow}& \mathbb{R}^1 \\ {}^{\mathllap{p}}\downarrow & \nearrow_{ \mathrlap{ \text{exists precisely if} \; S \; \text{is invariant} } } \\ X/\mathfrak{g} }

Proof

An $\mathbb{R}$ -algebra homomorphism

CE( X/\mathfrak{g} ) \overset{S^\ast}{\longleftarrow} C^\infty(\mathbb{R}^1)

is fixed by what it does to the canonical coordinate function $x$ on $\mathbb{R}^1$ , which is taken by $S^\ast$ to $S \in C^\infty(X) \hookrightarrow CE(X/\mathfrak{g})$ . For this to be a dg-algebra homomorphism it needs to respect the differentials on both sides. Since the differential on the right is trivial, the condition is that $0 = d_{CE} S = R(-)(f)$ :

\array{ \left\{ S \right\} &\overset{S^\ast}{\longleftarrow}& \left\{ x \right\} \\ {}^{\mathllap{d_{CE(X/\mathfrak{g})}}}\downarrow && \downarrow^{\mathrlap{d_{CE(\mathbb{R}^1)} = 0 } } \\ \left\{ R(-)(S) = 0 \right\} &\underset{S^\ast}{\longleftarrow}& \left\{ 0 \right\} }

Given a gauge invariant function, hence a function $S \colon X/\mathfrak{g} \to \mathbb{R}$ on a Lie algebroid (example ), its exterior derivative $d S$ should be a section of the cotangent bundle of the Lie algebroid. Moreover, if all field variations are infinitesimal (as in def. ) then it should in fact be a section of the infinitesimal neighbourhood (example ) of the zero section inside the cotangent bundle, the infinitesimal cotangent bundle $T^\ast_{inf}(X/\mathfrak{g})$ of the Lie algebroid (def. ebelow).

To motivate the definition below of infinitesimal cotangent bundle of a Lie algebroid recall from example that the algebra of functions on the infinitesimal cotangent bundle should be fiberwise the formal power series algebra in the linear functions. But a fiberwise linear function on a cotangent bundle is by definition a vector field. Finally observe that vector fields are equivalently derivations of smooth functions (prop. ). This leads to the following definition:

Definition

(infinitesimal cotangent Lie algebroid)

Let $\mathfrak{a}$ be a Lie ∞-algebroid (def. ) over some manifold $X$ . Then its infinitesimal cotangent bundle $T^\ast_{inf} \mathfrak{a}$ is the Lie ∞-algebroid over $X$ whose underlying graded module over $C^\infty(X)$ is the direct sum of the original module with the derivations of the graded algebra underlying $CE(\mathfrak{a})$ :

(T^\ast_{inf} \mathfrak{a})^\ast_\bullet \;\coloneqq\; \mathfrak{a}^\ast_\bullet \oplus Der(CE(\mathfrak{a}))_\bullet

with differential on the summand $\mathfrak{a}$ being the original differential and on $Der(CE(\mathfrak{a}))$ being the graded commutator with the differential $d_{CE(\mathfrak{a})}$ on $CE(\mathfrak{a})$ (which is itself a graded derivation of degree +1):

\array{ \mathllap{ d_{CE(T^\ast_{inf} \mathfrak{a})} } &\mathrlap{ \vert_{\mathfrak{a}^\ast} }& & \coloneqq & d_{CE(\mathfrak{a})} \\ \mathllap{ d_{CE(T^\ast_{inf} \mathfrak{a})} } & \mathrlap{ \vert_{Der(\mathfrak{a})} } & \phantom{ \vert_{Der(\mathfrak{a})} } & \coloneqq & [d_{CE(\mathfrak{a})},-] }

Just as for ordinary cotangent bundles (def. ) there is a canonical homomorphism of Lie algebroids (def. ) from the infinitesimal cotangent Lie algebroid down to the base Lie algebroid:

(176)

\array{ T^\ast_{inf} \mathfrak{a} &\phantom{AAA}&& CE(T^\ast_{inf} \mathfrak{g}) &=& CE(\mathfrak{a}) &\oplus& \wedge^{\bullet \geq 1}_{CE(\mathfrak{a})} Der(\mathfrak{a}) \\ \downarrow^{\mathrlap{cb}} &&& \uparrow^{\mathrlap{cb^\ast}} && \uparrow^{\mathrlap{id}} && \uparrow^{\mathrlap{0}} \\ \mathfrak{a} &&& CE(\mathfrak{a}) &=& CE(\mathfrak{a}) &\oplus& 0 }

given dually by the identity on the original generators.

Example

(infinitesimal cotangent bundle of action Lie algebroid)

Let $X/\mathfrak{g}$ be an action Lie algebroid (def. ) whose Chevalley-Eilenberg differential is given in local coordinates by (171)

d_{CE(X/\mathfrak{g})} \;=\; \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma \frac{\partial}{\partial c^\alpha} + c^\alpha R_a^\alpha \frac{\partial}{\partial \phi^a} \,.

Then its infinitesimal cotangent Lie algebroid $T^\ast_{inf} (X/\mathfrak{g})$ (def. ) has the generators

\array{ & \left( \frac{\partial}{\partial c^\alpha} \right) & \left( \phi^a \right) , \left( \frac{\partial}{\partial \phi^a} \right) & \left( c^\alpha \right) \\ deg = & -1 & 0 & +1 }

and we find that CE-differential on the new derivation generators is given by

(177)

\begin{aligned} d_{CE(T^\ast_{inf}(X/\mathfrak{g}))} \left( \frac{\partial}{\partial c^\alpha} \right) & \coloneqq \left[d_{CE(X/\mathfrak{g})}, \frac{\partial}{\partial c^\alpha} \right] \\ & = R_\alpha^a \frac{\partial}{\partial \phi^a} + \gamma^\beta{}_{\alpha \gamma} c^\gamma \frac{\partial}{\partial c^\beta} \end{aligned}

and

(178)

\begin{aligned} d_{CE(T^\ast_{inf}(X/\mathfrak{g}))} \left( \frac{\partial}{\partial \phi^a} \right) & \coloneqq \left[ d_{CE(X/\mathfrak{g})}, \frac{\partial}{\partial \phi^a} \right] \\ & = - c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \frac{\partial}{\partial \phi^b} \end{aligned} \,.

To amplify that the derivations on $CE(X/\mathfrak{g})$ , such as $\frac{\partial}{\partial \phi^a}$ and $\frac{\partial}{\partial c^\alpha}$ , are now coordinate functions in $CE(T^\ast_{inf}(X/\mathfrak{g}))$ one writes them as

(179)

\phi^\ddagger_a \;\coloneqq\; \frac{\partial}{\partial \phi^a} \phantom{AAAAA} c\ddagger_\alpha \;\coloneqq\; \frac{\partial}{\partial c^\alpha} \,.

so that the generator content then reads as follows:

(180)

\array{ & \left( c^\ddagger_\alpha \right) & \left( \phi^a \right) , \left( \phi^\ddagger_a \right) & \left( c^\alpha \right) \\ deg = & -1 & 0 & +1 } \,.

In this notation the full action of the CE-differential for $T^\ast_{inf}(X/\mathfrak{g})$ is therefore the following:

(181)

\array{ & d_{CE(T^\ast_{inf}(X/\mathfrak{g}))} \\ \phi^a &\mapsto& c^\alpha R^a_\alpha \\ c^\alpha & \mapsto& \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma \\ \phi^\ddagger_a &\mapsto& - c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \phi^\ddagger_b \\ c^\ddagger_\alpha &\mapsto& R_\alpha^a \phi^\ddagger_a + \gamma^\beta{}_{\alpha \gamma} c^\gamma c^\ddagger_\beta }

With a concept of cotangent bundles for Lie algebroids in hand, we want to see next that their sections are differential 1-forms on a Lie algebroid in an appropriate sense:

Proposition

(exterior differential of invariant function is section of infinitesimal cotangent bundle)

For $\mathfrak{a}$ a Lie ∞-algebroid (def. ) over some $X$ ; and $S \;\colon\;\mathfrak{a} \longrightarrow \mathbb{R}$ a invariant smooth function on it (example ) there is an induced section $d S$ of the infinitesimal cotangent Lie algebroid (def. ) bundle projection (176):

\array{ && T^\ast_{inf} \mathfrak{a} \\ & {}^{\mathllap{d S}}\nearrow & \downarrow^{\mathrlap{cb}} \\ \mathfrak{a} &=& \mathfrak{a} } \,,

given dually by the homomorphism of differential graded-commutative superalgebras

(d S)^\ast \;\colon\; CE(T^\ast_{inf} \mathfrak{a}) \longrightarrow CE(\mathfrak{a})

which sends

the generators in $\mathfrak{a}^\ast$ to themselves;
a vector field $v$ on $X$ , regarded as a degree-0 derivation to $d S(v) = v(S) \in C^\infty(X)$ ;
all other derivations to zero.

Proof

We discuss the proof in the special case that $\mathfrak{a} = X/\mathfrak{g}$ is an action Lie algebroid (def. ) hence where $T^\ast_{inf}(\mathfrak{a}) = T^\ast_{inf}(X/\mathfrak{g})$ is as in example . The general case is directly analogous.

Since $(d S)^\ast$ has been defined on generators, it is uniquely a homomorphism of graded algebras. It is clear that if $(d S)^\ast$ is indeed a homomorphism of differential graded-commutative superalgebras in that it also respects the CE-differentials, then it yields a section as claimed, because by definition it is the identity on $\mathfrak{a}^\ast$ . Hence all we need to check is that $(d S)^\ast$ indeed respects the CE-differentials.

On the original generators in $\mathfrak{a}^\ast$ this is immediate, since on these the CE-differential on both sides are by definition the same.

On the derivation $\phi^\ddagger_a \coloneqq \frac{\partial}{ \partial \phi^a}$ we find from (178)

\array{ \left\{ \frac{\partial S}{\partial \phi^a} \right\} &\overset{(d S)^\ast}{\longleftarrow}& \left\{ \phi^\ddagger_a \right\} \\ {}^{\mathllap{d_{CE(X/\mathfrak{g})}}}\downarrow && \downarrow^{\mathrlap{d_{CE(T^\ast_{inf} (X/\mathfrak{g}))}}} \\ \left\{ -c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \frac{\partial S}{\partial \phi^b} \right\} &\underset{(d S)^\ast}{\longleftarrow}& \left\{ -c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \phi^\ddagger_b \right\} }

Notice that the left vertical map is indeed as shown, due to the invariance of $S$ (example ), which allows an “integration by parts”:

\begin{aligned} d_{CE(X/\mathfrak{g})}\left( \frac{\partial S}{\partial \phi_a} \right) & = c^\alpha R_\alpha^{b} \frac{\partial}{\partial \phi^b} \frac{\partial}{\partial \phi^a} S \\ & = \frac{\partial}{\partial \phi^a} \left( c^\alpha \underset{ = 0 }{ \underbrace{ R_\alpha^b \frac{\partial S}{\partial \phi^b} } } \right) \;-\; c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \frac{\partial S}{\partial \phi^b} \end{aligned}

Similarly, on the derivation $c^\ddagger_\alpha \coloneqq \frac{\partial}{\partial c^\alpha}$ we find from (177) and using the invariance of $S$ (example )

\array{ \left\{ 0 \right\} &\overset{(d S)^\ast}{\longleftarrow}& \left\{ c^\ddagger_\alpha \right\} \\ {}^{\mathllap{d_{CE(X/\mathfrak{g})}}}\downarrow && \downarrow^{\mathrlap{d_{CE(T^\ast_{inf}(X/\mathfrak{g}))}}} \\ \left\{ 0 = R_\alpha^a \frac{\partial S}{\partial \phi^a} \right\} &\underset{(d S)^\ast}{\longleftarrow}& \left\{ R_\alpha^a \phi^\ddagger_a + \gamma^\beta{}_{\alpha \gamma} c^\gamma c^\ddagger_\alpha \right\} } \,.

This shows that the differentials are being respected.

Next we describe the vanishing locus of $d S$ , hence the critical locus of $S$ . Notice that if $d S$ is regarded as an ordinary differential 1-form on an ordinary smooth manifold $X$ , then its ordinary vanishing locus

X_{d S = 0} \;=\; \left\{ x \in X \;\vert\; d S(x) = 0 \right\}

is simply the fiber product of $d S$ with the zero section of the cotangent bundle, hence the universal space that makes the following diagram commute:

\array{ X_{d S = 0} &\overset{\phantom{AAA}}{\hookrightarrow}& X \\ \downarrow && \downarrow^{\mathrlap{0}} \\ X &\underset{d S}{\longrightarrow}& T^\ast_{inf} X } \,.

This is just the general abstract way to express the equation $d S = 0$ .

In this general abstract form the concept of critical locus generalizes to invariant functions on super Lie algebroids, where the vanishing of $d S$ is regarded only up to homotopy, namely up to infinitesimal symmetry transformations by the Lie algebra $\mathfrak{g}$ . In this homotopy-theoretic refinement we speak of the derived critical locus. The following definition simply states what this comes down to in components. For a detailed derivation see at derived critical locus and for general introduction to higher differential geometry and higher Lie theory see at Higher structures in Physics.

Definition

(derived critical locus of invariant function on Lie ∞-algebroid)

Let $\mathfrak{a}$ be a Lie ∞-algebroid (def. ) over some $X$ , let

S \;\colon\; \mathfrak{a} \longrightarrow \mathbb{R}

be an invariant function (example ) and consider the section of its infinitesimal cotangent bundle $T^\ast_{inf} \mathfrak{a}$ (def. ) corresponding to its exterior derivative via prop. :

\array{ \mathfrak{a} && \overset{d S}{\longrightarrow} && T^\ast_{inf} \mathfrak{a} \\ & {}_{\mathllap{id}}\searrow && \swarrow_{\mathrlap{cb}} \\ && \mathfrak{a} }

Then the derived critical locus of $S$ is the derived Lie algebroid (def. ) to be denoted $\mathfrak{a}_{d S \simeq 0}$ which is the homotopy pullback of the section $d S$ along the zero section:

\array{ \mathfrak{a}_{d S \simeq 0} &\longrightarrow& \mathfrak{a} \\ \downarrow &(pb)& \downarrow^{\mathrlap{0}} \\ \mathfrak{a} &\underset{d S}{\longrightarrow}& T^\ast_{inf} \mathfrak{a} } \,.

This means equivalently (details are at derived critical locus) that the Chevalley-Eilenberg algebra of $\mathfrak{a}_{d S \simeq 0}$ is like that of the infinitesimal cotangent Lie algebroid $T^\ast_{inf} \mathfrak{a}$ (def. ) except for two changes:

all derivations are shifted down in degree by one;

rephrased in terms of graded manifold (remark ) this means that the graded manifold underlying $\mathfrak{a}_{d S \simeq 0}$ is $T^\ast_{inf}[-1]\mathfrak{a}$ ;
the Chevalley-Eilenberg differential on the derivations coming from tangent vector fields $v$ on $X$ is that of the infinitesimal cotangent Lie algebroid $T^\ast_{inf} \mathfrak{a}$ plus $d S(v) = v(S)$ .

We now make the general concept of derived critical locus inside an L-∞ algebroid (def. ) explicit in our running example of an action Lie algebroid; the reader not concerned with the general idea of homotopy pullbacks may consider the following example as the definition of derived critical locus for the purposes of our running examples:

Example

(derived critical locus inside action Lie algebroid)

Consider an invariant function (def. ) on an action Lie algebroid (def. )

S \;\colon\; X/\mathfrak{g} \overset{\phantom{AAA}}{\longrightarrow} \mathbb{R}

for the case that the underlying supermanifold $X$ is a super Cartesian space (def. ) with global coordinates $(\phi^a)$ as in example . Then the derived critical locus (def. )

(X/\mathfrak{g})_{d S \simeq 0}

is, in terms of its Chevalley-Eilenberg algebra $CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right)$ (def. ) given as follows:

Its generators are those of $CE\left( T^\ast_{inf}(X/\mathfrak{g}) \right)$ as in (180), except for a shift of degree of the derivation-generators down by one:

\array{ & \left( c^\ddagger_{\alpha} \right) & \left( \phi^\ddagger_a \right) & \left( \phi^a \right) & \left( c^\alpha \right) \\ deg = & -2 & -1 & 0 & +1 }

Rephrased in terms of graded manifold (remark ) this means that the graded manifold underlying the derived critical locus is the shifted infinitesimal cotangent bundle of the graded manifold $\mathfrak{g}[1] \times X$ (170) which underlies the action Lie algebroid (def. ):

(182)

(X/\mathfrak{g})_{d S \simeq 0} \;=_{grmfd}\; T^\ast_{inf}[-1]\left( \mathfrak{g}[1] \times X \right)

and if $X = \mathbb{R}^{b\vert s}$ is a super Cartesian space this becomes more specifically

\begin{aligned} (\mathbb{R}^{p \vert q}/\mathfrak{g})_{d S \simeq 0} & =_{grmfd} T^\ast_{inf}[-1]\left( \mathfrak{g}[1] \times \mathbb{R}^{p \vert q} \right) \\ & =_{\phantom{grmfd}} \underset{ (c^\alpha) }{ \underbrace{ \mathfrak{g}[1] }} \times \underset{ (\phi^a) }{ \underbrace{ \mathbb{R}^{p\vert q} }} \times \underset{ (\phi^\ddagger_a) }{ \underbrace{ (\mathbb{R}^{p \vert q})^\ast_{inf}[-1] }} \times \underset{ (c^\ddagger_\alpha) }{ \underbrace{ \mathfrak{g}^\ast[-2] }} \end{aligned}

Moreover, on these generators the CE-differential is given by

(183)

\array{ & d_{CE\left((X/\mathfrak{g})_{d S \simeq 0}\right)} \\ \phi^a &\mapsto& c^\alpha R^a_\alpha \\ c^\alpha & \mapsto& \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma \\ \phi^\ddagger_a &\mapsto& \underset{ new }{ \underbrace{ \frac{\partial S}{\partial \phi^a} }} - c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \phi^\ddagger_b \\ c^\ddagger_\alpha &\mapsto& R_\alpha^a \phi^\ddagger_a + \gamma^\beta{}_{\alpha \gamma} c^\gamma c^\ddagger_b }

which is just the expression for the differential (181) in $CE\left( T^\ast_{inf}(X/\mathfrak{g}) \right)$ from example , except for the fact that (the derivations are shifted down in degree and) the new term $\frac{\partial S}{\partial \phi^a}$ over the brace.

The following example illustrates how the concept of derived critical locus $X_{d S \simeq 0}$ of $S$ is a homotopy theoretic version of the ordinary concept of critical locus $X_{d S = 0}$ :

Example

(ordinary critical locus is cochain cohomology of derived critical locus in degree 0)

Let $X$ be an superpoint (def. ) or more generally the infinitesimal neighbourhood (example ) of a point in a super Cartesian space (def. ) with coordinate functions $(\phi^a)$ , so that its algebra of functions $C^\infty(X)$ is a truncated polynomial algebra or formal power series algebra in the variables $\phi^a$ .

Consider for simplicity the special case that $\mathfrak{g} = 0$ so that there is no Lie algebra action on $X$ .

Then the Chevalley-Eilenberg algebra of the derived critical locus $X_{d S \simeq 0}$ of $S$ (example ) has generators

\begin{aligned} & & \left( \phi^\ddagger_a \right) & \left( \phi^a \right) & \\ deg = & & -1 & 0 & \end{aligned}

and differential given by

\array{ & d_{CE\left( X_{d S \simeq 0} \right)} \\ \phi^a &\mapsto& 0 \\ \phi^\ddagger_a &\mapsto& \frac{\partial S}{\partial \phi^a} } \,.

Hence the cochain cohomology of the Chevalley-Eilenberg algebra of the derived critical locus indegree 0 is the quotient of $C^\infty(X)$ by the ideal which is generated by $\left( \frac{\partial S}{\partial \phi^a} \right)$

H^0\left( CE\left( X_{d S \simeq 0} \right) \right) \;=\; C^\infty(X)/\left( \frac{\partial S}{\partial \phi^a} \right) \,.

But under the assumption that $X$ is a superpoint or infinitesimal neighbourhood of a point, this quotient algebra is just the algebra of functions on the ordinary critical locus $X_{d S = 0}$ .

(The quotient says that every function on $X$ which vanishes where $\frac{\partial S}{\partial \phi^a}$ vanishes is zero in the quotient. This means that the quotient algebra consists of the functions on $X$ modulo the equivalence relation that identifies two if they agree on the critical locus $X_{d S = 0}$ , which is the functions on $X_{d S = 0}$ .)

Hence the derived critical locus yields the ordinary critical locus in cochain cohomology:

H^0\left( CE\left( X_{d S \simeq 0} \right) \right) \;\simeq\; C^\infty\left( X_{d S = 0} \right) \,.

However, it is not in general the case that the derived critical locus is a resolution of the ordinary critical locus, in that all its cohomology in negative degree vanishes. Instead, the cohomology of the Chevalley-Eilenberg algebra of a derived critical locus in negative degree detects Lie algebra action and more generally L-∞ algebra action on $X$ under which $S$ is invariant. If this action is incorporated into $X$ by passing to the action Lie algebroid $X/\mathfrak{g}$ and then forming the derived critical locus $(X/\mathfrak{g})_{d S \simeq 0}$ in there, as in example .

This issue we discuss in detail in the chapter Gauge fixing, see prop. below.

In order to generalize the statement of example to the case that a Lie algebra action is taken into account, we need to realize the Chevalley-Eilenberg algebra of a derived critical locus in a Lie algebroid is the total complex of a double complex:

Proposition

(Chevalley-Eilenberg algebra of derived critical locus is total complex of BV-BRST bicomplex)

Let $(X/\mathfrak{g})_{d S \simeq 0}$ be a derived critical locus inside an action Lie algebroid as in example . Then its Chevalley-Eilenberg differential (183) may be decomposed as the sum of two anti-commuting differential

d_{CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right)} \;=\; s_{BRST} + s_{BS}

which are defined on the generators of the Chevalley-Eilenberg algebra as follows:

(184)

\array{ & s_{BV} \\ \phi^a &\mapsto& 0 \\ c^\alpha & \mapsto& 0 \\ \phi^\ddagger_a &\mapsto& \frac{\partial S}{\partial \phi^a} \\ c^\ddagger_\alpha &\mapsto& R_\alpha^a \phi^\ddagger_a \\ \phantom{A} \\ & s_{BRST} \\ \phi^a &\mapsto& c^\alpha R^a_\alpha \\ c^\alpha & \mapsto& \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma \\ \phi^\ddagger_a &\mapsto& - c^\alpha \frac{\partial R_\alpha^b}{\partial \phi^a} \phi^\ddagger_b \\ c^\ddagger_\alpha &\mapsto& \gamma^\beta{}_{\alpha \gamma} c^\gamma c^\ddagger_b }

If we moreover decompose the degree of the generators into two degrees

\array{ & \left( c^\ddagger_{\alpha} \right) & \left( \phi^\ddagger_a \right) & \left( \phi^a \right) & \left( c^\alpha \right) \\ deg_{gh} = & 0 & 0 & 0 & +1 \\ deg_{af} = & -2 & -1 & 0 & 0 }

then these two differentials constitute a bicomplex

\array{ CE^{0,0}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{1,0}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{2,0}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& \cdots \\ \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \\ CE^{0,-1}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{1,-1}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{2,-1}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& \cdots \\ \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \\ CE^{0,-2}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{1,-2}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& CE^{2,-2}\left( (X/\mathfrak{g})_{d S \simeq 0}\right) &\overset{s_{BRST}}{\longrightarrow}& \cdots \\ \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \big\uparrow^{\mathrlap{s_{BV}}} && \\ \vdots && \vdots && \vdots }

whose total complex is the Chevalley-Eilenberg dg-algebra of the derived critical locus

\begin{aligned} CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) & = \underset{ gh, af }{\bigoplus} CE^{gh,af}\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \\ d_CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right){} & = s_{BV} + s_{BRST} \end{aligned} \,.

Proof

It is clear from the definition that the graded derivations $s_{BV}$ and $s_{BRST}$ have (i.e. increase) bidegree as follows:

\array{ & s_{BRST} & s_{BV} \\ deg_{gh} = & +1 & 0 \\ deg_{af} = & 0 & +1 } \,.

This implies that in

\begin{aligned} 0 & = \left( d_{CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right)} \right)^2 \\ & = \left( s_{BV} + s_{BRST}\right)^2 \\ & = \underset{ = 0 }{ \underbrace{ \left( s_{BV}\right)^2 }} + \underset{ = 0 }{ \underbrace{ \left( s_{BRST} \right)^2 }} + \underset{ = 0 }{ \underbrace{ \left[ s_{BV}, s_{BRST} \right] } } \end{aligned}

all three terms have to vanish separately, as shown, since they each have different bidegree (the last term denotes the graded commutator, hence the anticommutator). This is the statement to be proven.

Notice that the nilpotency of $s_{BV}$ is also immediately checked explicitly, due to the invariance of $S$ (example ):

\begin{aligned} s_{BV} \left( s_{BV} \left( c^\ddagger_\alpha \right) \right) & = s_BV\left( R_\alpha^a \phi^\ddagger_a \right) \\ & = R_\alpha^a \frac{\partial S}{\partial \phi^a} \\ & = 0 \end{aligned}

As a corollary of prop. \refDerivedCriticalLocusOfActionLiAlgebroidBicomplexStructure{} we obtain the generalization of example to non-trivial $\mathfrak{g}$ -actions:

Proposition

(cochain cohomology of BV-BRST complex in degree 0 is the invariant function on the critical locus)

Let $(X/\mathfrak{g})_{d S \simeq 0}$ be a derived critical locus inside an action Lie algebroid as in example .

Then if the vertical differential (prop. )

\array{ CE^{\bullet, \bullet+1}\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \\ \uparrow^{\mathrlap{s_{BV}}} \\ CE^{\bullet, \bullet}\left( (X/\mathfrak{g})_{d S \simeq 0} \right) }

has vanishing cochain cohomology in negative $af$ -degree

(185)

H^{\bullet \leq 1}(s_{BV}) = 0

then the cochain cohomology of the full Chevalley-Eilenberg dg-algebra is given by the cochain cohomology of $s_{BRST}$ on $H^0(s_{BV})$ :

H^k\left( CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \right) \;\simeq\; H^k\left( H^0(s_{BV}), s_{BRST} \right) \,.

Moreover if $X$ is inside the infinitesimal neighbourhood of a point as in example then the full cochain cohomology in degree 0 is the space of those functions on the ordinary critical locus $X_{d S = 0}$ which are $\mathfrak{g}$ -invariant:

H^0 \left( CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \right) \;=\; \left\{ X_{d S = 0} \overset{f}{\to} \mathbb{R} \;\vert\; \left(R_\alpha^a \frac{\partial f}{\partial \phi^a} = 0\right) \right\}

Proof

The first statement follows from the spectral sequence of the double complex

H^{gh} \left( H^{af} \left( CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \right) \right) \;\Rightarrow\; H^{gh + af}\left( CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \right) \,.

Under the given assumption the second page of this spectral sequence is concentrated on the row $af = 0$ . This implies that all differentials on this page vanish, so that the sequence collapses on this page. Moreover, since the spectral sequence consists of vector spaces (modules over the real numbers) the extension problem is trivial, and hence the claim follows.

Now if $X$ is inside the infinitesimal neighbourhood of a point, then example says that $H^0(s_{BV})$ in $deg_{gh} = 0$ consists of the functions on the ordinary critical locus and hence the abvove result implies that

\begin{aligned} H^0\left( CE\left( (X/\mathfrak{g})_{d S \simeq 0}\right) \right) & = ker(s_{BRST})\vert_{C^\infty\left( X_{d S = 0} \right) } \,/\, \underset{= 0}{ \underbrace{ im(s_{BRST})\vert_{C^\infty\left( X_{d S = 0} \right)} } } \\ & = ker(s_{BRST})\vert_{C^\infty\left( X_{d S = 0} \right) } \\ & = \left\{ X_{d S = 0} \overset{f}{\longrightarrow} \mathbb{R} \,\vert\, \left( R_\alpha^a \frac{\partial S}{\partial \phi^a} = 0 \right) \right\} \end{aligned}

This means that under condition (185) the construction of a derived critical locus inside an action Lie algebroid provides a resolution of the space of those functions which are

restricted to the critical locus (a homotopy intersection);
invariant under the Lie algebra action (a homotopy quotient).

We apply this general mechanism below to Lagrangian field theory, where it serves to provide a resolution by the BV-BRST complex of the space of observables which are

on-shell,
gauge invariant.

But in order to control this application, we first establish the tool of the Schouten bracket/antibracket.

$\,$

Schouten bracket/antibracket

Since the infinitesimal cotangent Lie algebroid $T^\ast_{inf} \mathfrak{a}$ has function algebra given by tensor products of tangent vector fields/derivations, we expect that a graded analogue of the Lie bracket of ordinary tangent vector fields exists on the Chevalley-Eilenberg algebra $CE\left( T^\ast_{inf} \mathfrak{a}\right)$ . This is indeed the case, and crucial for the theory:

Definition

(Schouten bracket and antibracket for action Lie algebroid)

Consider a derived critical locus $(X/\mathfrak{g})_{d S \simeq 0}$ inside an action Lie algebroid $X/\mathfrak{g}$ as in example .

Then the graded commutator of graded derivations of the Chevalley-Eilenberg algebra of $X/\mathfrak{g}$

[-,-] \;\colon\; Der(CE(X/\mathfrak{g})) \otimes Der(CE(X/\mathfrak{g})) \longrightarrow Der(CE(X/\mathfrak{g}))

uniquely extends, by the graded Leibniz rule, to a graded bracket of degree $(1,even)$ on the CE-algebra of the derived critical locus $(X/\mathfrak{g})_{d S \simeq 0}$

\left\{ -,-\right\} \;\colon\; CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \otimes C\left( (X/\mathfrak{g})_{d S \simeq 0} \right) \longrightarrow CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right)

such that this is a graded derivation in both arguments.

This is called the Schouten bracket.

There is an elegant way to rewrite this in terms of components: With the notation (179) for the coordinate-derivations the Schouten bracket is equivalently given by

(186)

\begin{aligned} \left\{ f,g \right\} & = \phantom{+} \frac{\overset{\leftarrow}{\partial} f}{\partial \phi^\ddagger_a} \frac{\overset{\rightarrow}{\partial} g}{\partial {\phi}^a} - \frac{\overset{\leftarrow}{\partial} f}{\partial \phi^a} \frac{\overset{\rightarrow}{\partial} g}{\partial \phi^\ddagger_a} \\ & \phantom{=} + \frac{\overset{\leftarrow}{\partial} f}{\partial c^\ddagger_\alpha} \frac{\overset{\rightarrow}{\partial} g}{\partial {c}^{\alpha}} - \frac{\overset{\leftarrow}{\partial} f}{\partial c^{\alpha}} \frac{\overset{\rightarrow}{\partial} g}{\partial c^\ddagger_\alpha} \end{aligned} \,,

where the arrow over the partial derivative indicates that we we pick up signs via the Leibniz rule either as usual, going through products from left to right (for $\overset{\rightarrow}{\partial}$ ) or by going through the products from right to left (for $\overset{\leftarrow}{\partial}$ ).

In this form the Schouten bracket is called the antibracket.

(e. g. Henneaux 90, (53d), Henneaux-Teitelboim 92, section 15.5.2)

The power of the Schouten bracket/antibracket rests in the fact that it makes the Chevalley-Eilenberg differential on a derived critical locus $(X/\mathfrak{g})_{d S \simeq 0}$ become a Hamiltonian vector field, for “Hamiltonian” the sum of $S$ with the Chevalley-Eilenberg differential of $X/\mathfrak{g}$ :

Example

(Chevalley-Eilenberg differential of derived critical locus is Hamiltonian vector field for the Schouten bracket/antibracket)

Let $(X/\mathfrak{g})_{d S \simeq 0}$ be a derived critical locus inside an action Lie algebroid as in example .

Then the CE-differential (183) of the derived critical locus $X/\mathfrak{g}\vert_{S \simeq 0}$ is simply the Schouten bracket/antibracket (def. ) with the sum

(187)

S_{\text{BV-BRST}} \;\coloneqq\; S - d_{CE(X/\mathfrak{g})}

of the Chevalley-Eilenberg differential of $X/\mathfrak{g}$ and the function $-S$ :

d_{CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right) }(-) \;=\; \left\{ - S + d_{CE(X/\mathfrak{g})} \,,\, (-) \right\} \,.

In coordinates, using the expression for $d_{CE(X/\mathfrak{g})}$ from (171) and using the notation for derivations from (179) this means that

d_{CE\left( (X/\mathfrak{g})_{d S \simeq 0} \right)}(-) \;=\; \left\{ - S + c^\alpha R_\alpha^a \phi^\ddagger_a - \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_\alpha \,,\, (-) \right\} \,.

Proof

This is a simple straightforward computation, but we spell it out for illustration of the general principle. The result is to be compared with (183):

for $\phi^a$ :

\begin{aligned} \left\{ - S + c^\alpha R_\alpha^{a'} \phi^\ddagger_{a'} - \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_\alpha \,,\, \phi^a \right\} & = \left\{ c^\alpha R_\alpha^{a'} \phi^\ddagger_{a'} \,,\, \phi^a \right\} \\ & = c^\alpha R_\alpha^{a'} \underset{ \delta_{a'}^a }{ \underbrace{ \left\{ \phi^\ddagger_{a'} \,,\, \phi^a \right\} } } \\ & = c^\alpha R_\alpha^{a} \end{aligned}

for $c^\alpha$ :

\begin{aligned} \left\{ - S + c^\alpha R_\alpha^{a} \phi^\ddagger_{a} - \tfrac{1}{2}\gamma^{\alpha'}{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_{\alpha'} \,,\, c^\alpha \right\} & = \left\{ \tfrac{1}{2}\gamma^{\alpha'}{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_{\alpha'} \,,\, c^\alpha \right\} \\ & = \tfrac{1}{2}\gamma^{\alpha'}{}_{\beta \gamma} c^\beta c^\gamma \underset{ \delta_{\alpha'}^\alpha }{ \underbrace{ \left\{ c^\ddagger_{\alpha'} \,,\, c^\alpha \right\} } } \\ & = \tfrac{1}{2}\gamma^{\alpha}{}_{\beta \gamma} c^\beta c^\gamma \end{aligned}

for $\phi^\ddagger_a$ :

\begin{aligned} \left\{ - S + c^\alpha R_\alpha^{a'} \phi^\ddagger_{a'} - \tfrac{1}{2}\gamma^{\alpha}{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_{\alpha} \,,\, \phi^\ddagger_a \right\} & = - \underset{ = -\frac{\partial S}{\partial \phi^a} }{ \underbrace{ \left\{ S \,,\, \phi^{\ddagger}_a \right\} } } + \left\{ c^\alpha R_\alpha^{a'} \phi^\ddagger_{a'} \,,\, \phi^\ddagger_a \right\} \\ & = \frac{\partial S}{\partial \phi^a} + c^\alpha \underset{ = -\frac{\partial R_\alpha^{a'}}{\partial \phi^a} }{ \underbrace{ \left\{ R_\alpha^{a'} \,,\, \phi^\ddagger_a \right\} } } \phi^\ddagger_{a'} \\ & = \frac{\partial S}{\partial \phi^a} - c^\alpha \frac{\partial R_\alpha^{a'}}{\partial \phi^a} \phi^\ddagger_{a'} \end{aligned}

for $c^\ddagger_\alpha$ :

\begin{aligned} \left\{ - S + c^{\alpha'} R_{\alpha'}^{a} \phi^\ddagger_{a} - \tfrac{1}{2}\gamma^{\alpha'}{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_{\alpha'} \,,\, c^\ddagger_\alpha \right\} & = \left\{ c^{\alpha'} R_{\alpha'}^a \phi^{\ddagger}_a \,,\, c^\ddagger_{\alpha} \right\} \;+\; \left\{ \tfrac{1}{2} \gamma^{\alpha'}{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_{\alpha'} \,,\, c^\ddagger_\alpha \right\} \\ & = \left\{ c^{\alpha'} \,,\, c^\ddagger_{\alpha} \right\} R_{\alpha'}^a \phi^{\ddagger}_a \;+\; \tfrac{1}{2} \gamma^{\alpha'}{}_{\beta \gamma} \underset{ = - c^\beta \delta_{\alpha}^\gamma + \delta_{\alpha}^\beta c^\gamma}{ \underbrace{ \left\{ c^\beta c^\gamma \,,\, c^\ddagger_\alpha \right\} }} c^\ddagger_{\alpha'} \\ & = R_\alpha^a \phi^\ddagger_{a} + \gamma^{\alpha'}{}_{\alpha \gamma} c^\gamma c^\ddagger_{\alpha'} \end{aligned}

Hence these values of the Schouten bracket/antibracket indeed all agree with the values of the CE-differential from (183).

As a corollary we obtain:

Proposition

(classical master equation)

Let $(X/\mathfrak{g})_{d S \simeq 0}$ be a derived critical locus inside an action Lie algebroid as in example .

Then the Schouten bracket/antibracket (def. ) of the function $S_{\text{BV-BRST}}$ S_{\text{BV-BRST}}

S_{\text{BV-BRST}} \;\coloneqq\; S - d_{CE\left( X/\mathfrak{g}\right)}

with itself vanishes:

\left\{ S_{\text{BV-BRST}} \,,\, S_{\text{BV-BRST}} \right\} \;=\; 0 \,.

Conversely, given a shifted cotangent bundle of the form $T^\ast[-1](X \times \mathfrak{g}[1])$ (182), then the struture of a differential of degree +1 on its algebra of functions is equivalent to a degree-0 element $S \in C^\infty(T^\ast[-1](X \times \mathfrak{g}[1]))$ such that

\left\{ S, S \right\} \;=\; 0 \,.

Since therefore this equation controls the structure of derived critical loci once the underlying manifold $X$ and Lie algebra $\mathfrak{g}$ is specified, it is also called the master equation and here specifically the classical master equation.

$\,$

This concludes our discussion of plain derived critical loci inside Lie algebroids. Now we turn to applying these considerations about to Lagrangian densities on a jet bundle, which are invariant under infinitesimal gauge symmetries generally only up to a total spacetime derivative. By example it is clear that this is best understood by first considering the refinement of the Schouten bracket/antibracket to this situation.

$\,$

local antibracket

If we think of the invariant function $S$ in def. as being the action functional (example ) of a Lagrangian field theory $(E,\mathbf{L})$ (def. ) over a compact spacetime $\Sigma$ , with $X$ the space of field histories (or rather an infinitesimal neighbourhood therein), hence with $\mathfrak{g}$ a Lie algebra of gauge symmetries acting on the field histories, then the Chevalley-Eilenberg algebra $CE\left((X/\mathfrak{g})_{d S \simeq 0}\right)$ of the derived critical locus of $S$ is called the BV-BRST complex of the theory.

In applications of interest, the spacetime $\Sigma$ is not compact. In that case one may still appeal to a construction on the space of field histories as in example by considering the action functional for all adiabatically switched $b \mathbf{L}$ Lagrangians, with $b \in C_{cp}^\infty(\Sigma)$ . This approach is taken in (Fredenhagen-Rejzner 11a).

Here we instead consider now the “local lift” or “de-transgression” of the above construction from the space of field histories to the jet bundle of the field bundle of the theory, refining the BV-BRST complex (prop. ) to the local BV-BRST complex (prop. below), corresponding to the local BRST complex from example (Barnich-Brandt-Henneaux 00).

This requires a slight refinement of the construction that leads to example : In contrast to the action functional $S = \tau_\Sigma(g\mathbf{L})$ (example ), the Lagrangian density $\mathbf{L}$ is not strictly invariant under infinitesimal gauge transformations, in general, rather it may change up to a horizontally exact term (by the very definition ). The same is then true, in general, for its Euler-Lagrange variational derivative $\delta_{EL} \mathbf{L}$ (unless we have already restricted to the shell, by prop. , which however here we do not explicitly, but only via passing to cochain cohomology as in example ).

This means that the Euler-Lagrange form $\delta_{EL} \mathbf{L}$ is, off-shell, not a section of the infinitesimal cotangent bundle (def. ) of the gauge action Lie algebroid on the jet bundle.

But it turns out that it still is a section of local refinement of the cotangent bundle, which is twisted by horizontally exact terms (prop. below). To see the required twist, it is most convenient to make use of a local version of the antibracket (def. below), via local refinement of example . As a result we may form the local derived critical locus as in def. but now with the invariance of the Lagrangian density only up to total spacetime derivatives taken into account. Its Chevalley-Eilenberg algebra is called the local BV-BRST complex (prop. below).

The following is the direct refinement of the concept of the underlying graded manifold of the infinitesimal cotangent bundle of an action Lie algebroid in example to the case where the base manifold is generalized to a field bundle (def. ) and the Lie algebra to a gauge parameter bundle (def. ):

Definition

(infinitesimal neighbourhood of zero section in cotangent bundle of fiber product of field bundle with shifted gauge parameter bundle)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over some spacetime $\Sigma$ , and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a bundle of gauge parameters (def. ) which are closed (def. ), inducing the Lie algebroid

E / ( \mathcal{G} \times_\Sigma T \Sigma ) \;=\; \left( J^\infty_\Sigma( E \times_\Sigma (\mathcal{G}[1]) ) , s_{BRST} ) \right)

whose Chevalley-Eilenberg algebra is the local BRST complex of the field theory (example ).

Then we write

T^\ast_{\Sigma,inf}\left( E \times_\Sigma (\mathcal{G}[1]) \right) \,, \phantom{AAA} T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma (\mathcal{G}[1]) \right)

for, on the left, the infinitesimal neighbourhood of the zero section of the vertical cotangent bundle of the graded fiber product of the field bundle with the fiber-wise shifted gauge parameter bundle, as well as its shifted version on the right, as in (182).

In local coordinates this means the following: Assuming that the field bundle $E$ and the gauge parameter bundle $\mathcal{G}$ are trivial vector bundles (example ) with fiber coordinates $(\phi^a)$ and $(c^\alpha)$ , respectively, then $T^\ast_{\Sigma,inf}\left(E \times_\Sigma (\mathcal{G}[1])\right)$ is the trivial graded vector bundle with fiber coordinates

(188)

\array{ T^\ast_{\Sigma,inf}\left( E \times_\Sigma (\mathcal{G}[1]) \right) & \phantom{AAAAA}& T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma (\mathcal{G}[1]) \right) \\ & \phantom{A} \\ \array{ & (c^\ddagger_\alpha), & (\phi^\ddagger_a),(\phi^a), & (c^\alpha) \\ deg = & -1 & 0 & 1 } & \phantom{AA}& \array{ & (c^\ddagger_\alpha), & (\phi^\ddagger_a)\, & (\phi^a), & (c^\alpha) \\ deg = & -2 & -1 & 0 & 1 } }

and such that smooth functions on $T^\ast_{\Sigma,inf}\left(E \times_\Sigma (\mathcal{G}[1])\right)$ are formal power series in $c^\ddagger_\alpha$ (necessarily due to degree reasons) and in $\phi^\ddagger_a$ (reflecting the infinitesimal neighbourhood of the zero section).

Here the shifted cotangents to the fields are called the antifields:

$\phi^\ddagger_a$ is antifield to the field $\phi^a$
$c^\ddagger_\alpha$ is antifield to the ghost field $c^\alpha$ .

The following is the direct refinement of the concept of the Schouten bracket on an action Lie algebroid from def. to the case where the base manifold is generalized to the jet bundle (def. ) field bundle (def. ) and the Lie algebra to the jet bundle of a gauge parameter bundle (def. ):

Definition

(local antibracket)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime $\Sigma$ (def. ), and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a bundle of gauge parameters (def. ) which are closed (def. ), inducing via example the Lie algebroid

E / ( \mathcal{G} \times_\Sigma T \Sigma ) \;=\; \left( J^\infty_\Sigma( E \times_\Sigma (\mathcal{G}[1]) ) , s_{BRST} ) \right)

whose Chevalley-Eilenberg algebra is the local BRST complex of the field theory with shifted infinitesimal vertical cotangent bundle

(189)

E_{\text{BV-BRST}} \;\coloneqq\; T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma (\mathcal{G}[1]) \right)

of its underlying graded bundle from def. .

Then on the horizontal $p+1$ -forms on this bundle (def. ) which in terms of the volume form may all be decomposed as (42)

H \;=\; h \, dvol_\Sigma \;\in\; \Omega^{p+1}_\Sigma\left( \,T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma (\mathcal{G}[1]) \right) \, \right)

the local antibrackets

\{-,-\}' , \{-,-\} \;\colon\; \Omega^{p+1,0}_\Sigma( \, T^\ast_{\Sigma,inf}[-1](E \times_\Sigma \mathcal{G}[1]) \, ) \,\otimes\, \Omega^{p+1,0}_\Sigma( \, T^\ast_{\Sigma,inf}[-1](E \times_\Sigma \mathcal{G}[1]) \, ) \longrightarrow \Omega^{p+1,0}_\Sigma( \, T^\ast_{\Sigma,inf}[-1](E \times_\Sigma \mathcal{G}[1]) \, )

are the functions which are given in the local coordinates (188) as follows:

The first version is

\begin{aligned} \left\{ f\, dvol_\Sigma \,,\,g \, dvol_\Sigma \right\}' & \coloneqq \phantom{+} \left( \frac{\overset{\leftarrow}{\delta}_{EL} f }{\delta \phi^\ddagger_a} \frac{\overset{\rightarrow}{\delta}_{EL} g}{\delta {\phi^a}^{\phantom{\ddagger}}} - \frac{\overset{\leftarrow}{\delta}_{EL}}{\delta {\phi^a}^{\phantom{\ddagger}}} \frac{\overset{\rightarrow}{\delta}_{EL} g}{\delta \phi^\ddagger_a} \right) dvol_\Sigma \\ & \phantom{=} + \left( \frac{\overset{\leftarrow}{\delta}_{EL} f}{\delta c^\ddagger_\alpha} \frac{\overset{\rightarrow}{\delta}_{EL} g}{\delta {c^\alpha}^{\phantom{\ddagger}}} - \frac{\overset{\leftarrow}{\delta}_{EL}}{\delta {c^\alpha}^{\phantom{\ddagger}}} \frac{\overset{\rightarrow}{\delta}_{EL} g}{\delta c^\ddagger_\alpha} \right) dvol_\Sigma \,. \end{aligned}

This is of the form of the Schouten bracket (186) but with Euler-Lagrange derivatives (50) instead of partial derivatives,

The second version is this:

(190)

\begin{aligned} \left\{ f \, dvol_\Sigma, g \, dvol_\Sigma \right\} & \coloneqq \phantom{+} \left( \left( \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( \frac{\overset{\leftarrow}{\delta}_{EL} f}{\delta \phi^a} \right) \right) \left( \frac{\overset{\rightarrow}{\partial} g}{\partial {\phi}^\ddagger_{a,\mu_1 \cdots \mu_k}} \right) - \left( \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( \frac{\overset{\leftarrow}{\delta}_{EL} f}{\delta \phi^\ddagger_a} \right) \right) \left( \frac{\overset{\rightarrow}{\partial} g}{\partial \phi^a_{,\mu_1 \cdots \mu_k}} \right) \right) \, dvol_\Sigma \\ & \phantom{\coloneqq} + \left( \left( \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( \frac{\overset{\leftarrow}{\delta}_{EL} f}{\delta c^\alpha} \right) \right) \left( \frac{\overset{\rightarrow}{\partial} g}{\partial {c}^\ddagger_{\alpha,\mu_1 \cdots \mu_k}} \right) - \left( \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( \frac{\overset{\leftarrow}{\delta}_{EL} f}{\delta c^\ddagger_\alpha} \right) \right) \left( \frac{\overset{\rightarrow}{\partial} g}{\partial c^\alpha_{,\mu_1 \cdots \mu_k}} \right) \right) \, dvol_\Sigma \end{aligned}

where again $\frac{\delta_{EL}}{\delta \phi^a}$ denotes the Euler-Lagrange variational derivative (50)

(Barnich-Henneaux 96 (2.9) and (2.12), reviewed in Barnich 10 (4.9))

Proposition

(basic properties of the local antibracket)

The local antibracket from def. satisfies the following properties:

The two versions differ by a total spacetime derivative (def. ):

$\{f,g\} = \{f,g\}' + d(...) \,.$
The primed version is strictly graded skew-symmetric:

$\left\{f \, dvol_\Sigma \,,\, g\, dvol_\Sigma \right\}' \;=\; - (-1)^{deg(f) deg(g)} \, \left\{g \, dvol_\Sigma \,,\, f\, dvol_\Sigma \right\}$
The unprimed version $\{-,-\}$ strictly satisfies the graded Jacobi identity; in that it is a graded derivation in the second argument, of degree one more than the degree of the first argument:

(191) $\left\{ f\, dvol_\Sigma, \left\{ g\, dvol_\Sigma \,,\, h\, dvol_\Sigma \right\}\right\} \;=\; \underset{ = \left\{ \left\{ f\, dvol_\Sigma \,,\, g\, dvol_\Sigma \right\}' \,, h\, dvol_\Sigma \right\} }{ \underbrace{ \left\{ \left\{ f\, dvol_\Sigma \,,\, g\, dvol_\Sigma \right\} \,,\, h\, dvol_\Sigma \right\} } } \;+\; (-1)^{(deg(f)+1) deg(g)} \left\{ g\, dvol_\Sigma \,,\, \left\{ f\, dvol_\Sigma \,,\, h\, dvol_\Sigma \right\} \right\}$

and the first term on the right is equivalently given by the primed bracket, as shown under the brace;
the horizontally exact horizontal differential forms are an ideal for either bracket, in that for $f dvol_\Sigma = d(\cdots)$ or $g dvol_\Sigma = d(\cdots)$ we have

$\{ f dvol_\Sigma, g \, dvol_\Sigma \}' = 0 \phantom{AAA} \{ f dvol_\Sigma, g \, dvol_\Sigma \} = d(\cdots)$

for all $f$ , $g$ of homogeneous degree $deg(f)$ and $deg(g)$ , respectively.

(Barnich-Henneaux 96 (B.6) and footnote 9).

Proof

That the two expressions differ by a horizontally exact terms follows by the very definition of the Euler-Lagrange derivative (50). Also the graded skew symmetry of the primed bracket is manifest.

The third point requires some computation (Barnich-Henneaux 96 (B.9)).

Finally that $\{-,-\}'$ vanishes when at least one of its arguments is horizontally exact follows from the fact that already the Euler-Lagrange derivative vanishes on this argument (example ). This implies that $\{-,-\}$ is horizontally exact when at least one of its arguments is so, by the first item.

The following is the local refinement of prop. :

Remark

(local classical master equation)

The third item in prop. implies that the following conditions on a Lagrangian density $\mathbf{K} \in \Omega^{p+1}_\Sigma( T^\ast_{\Sigma,inf}( E \times_\Sigma \mathcal{G}[1] ) )$ whose degree is even

\mathbf{K} = K\, dvol_\Sigma \,, \phantom{AAA} deg(L) \in 2 \mathbb{Z}

are equivalent:

forming the local antibracket (def. ) with $\mathbf{K}$ is a differential

$\left(\left\{ \mathbf{K},-\right\}\right)^2 = 0 \,,$
the local antibracket (def. ) of $\mathbf{K}$ with itself is a total spacetime derivative:

$\left\{ \mathbf{K}, \mathbf{K}\right\} = d(...)$
the other variant of the local antibracket (def. ) of $\mathbf{K}$ with itself is a total spacetime derivative:

$\left\{ \mathbf{K}, \mathbf{K}\right\}' = d(...)$

This condition is also called the local classical master equation.

$\,$

derived critical locus on jet bundle – the local BV-BRST complex

With the local version of the antibracket in hand (def. ) it is now straightforward to refine the construction of a derived critical locus inside an action Lie algebroid (example ) to the “derived” shell (51) inside the formal dual of the local BRST complex (example ). The result is a derived Lie algebroid whose Chevalley-Eilenberg algebra is called the local BV-BRST complex. This is example below.

The following definition is the local refinement of def. :

Definition

(local infinitesimal cotangent Lie algebroid)

E / ( \mathcal{G} \times_\Sigma T \Sigma ) \;=\; \left( J^\infty_\Sigma( E \times_\Sigma (\mathcal{G}[1]) ) , s_{BRST} ) \right)

whose Chevalley-Eilenberg algebra is the local BRST complex of the field theory.

Consider the case that both the field bundle $E \overset{fb}{\to} \Sigma$ (def. ) as well as the gauge parameter bundle $\mathcal{G} \overset{gb}{\to} \Sigma$ are trivial vector bundles (example ) over Minkowski spacetime $\Sigma$ (def. ) with field coordinates $(\phi^a)$ and gauge parameter coordinates $(c^\alpha)$ .

Then the vertical infinitesimal cotangent Lie algebroid (def. ) has coordinates as in (180) as well as all the corresponding jets and including also the horizontal differentials:

\array{ & \left( c^\ddagger_{\alpha,\mu_1 \cdots \mu_k} \right) & \left( \phi^a_{,\mu_1 \cdots \mu_k} \right) , \left( \phi^\ddagger_{a,\mu_1 \cdots \mu_k} \right) & \left( c^\alpha_{,\mu_1 \cdots \mu_k} \right), \left( d x^\mu \right) \\ deg = & -1 & 0 & +1 } \,.

In terms of these coordinates BRST differential $s_{BRST}$ , thought of as a prolonged evolutionary vector field on $E \times_\Sigma \mathcal{G}$ , corresponds to the smooth function on the shifted cotangent bundle given by

(192)

L_{BRST} \;=\; \left( \underset{k \in \mathbb{N}}{\sum} c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{a \mu_1 \cdots \mu_k} \right) \phi^\ddagger_a \;+\; \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_\alpha \;\in\; C^\infty\left( T^\ast_{\Sigma,inf}( E \times_\Sigma \mathcal{G}[1] ) \right) \,,

to be called the BRST Lagrangian function and the product with the spacetime volume form

L_{BRST} \, dvol_\Sigma \;\in\; \Omega^{p+1,0}_\Sigma(E \times_\Sigma \mathcal{G}[1])

as the BRST Lagrangian density.

We now define the Chevalley-Eilenberg differential on smooth functions on $T^\ast_{inf}( E/(\mathcal{G} \times_\Sigma T \Sigma) )$ to be given by the local antibracket $\{-,-\}$ (190) with the BRST Lagrangian density (192)

d_{CE(T^\ast_{\Sigma,inf}( E/(\mathcal{G} \times_\Sigma T \Sigma) ))} \;\coloneqq\; \left\{ L_{BRST} dvol_\Sigma, - \right\}

This defines an $L_\infty$ -algebroid to be denoted

T^\ast_{\Sigma,inf}( E/(\mathcal{G} \times_\Sigma T \Sigma) ) \,.

The local refinement of prop. is now this:

Proposition

(Euler-Lagrange form is section of local cotangent bundle of jet bundle gauge-action Lie algebroid)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over some spacetime $\Sigma$ , and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a gauge parameter bundle (def. ) which are closed (def. ), inducing via example the Lie algebroid $E / ( \mathcal{G} \times_\Sigma T \Sigma )$ and via def. its local cotangent Lie ∞-algebroid $T^\ast_{inf}_\Sigma(E / ( \mathcal{G} \times_\Sigma T \Sigma ))$ .

Then the Euler-Lagrange variational derivative (prop. ) constitutes a section of the local cotangent Lie ∞-algebroid (def. )

\array{ && T^\ast_{\Sigma,inf}\left( E/(\mathcal{G} \times_\Sigma T \Sigma) \right) \\ & {}^{\mathllap{ \delta_{EL} \mathbf{L} }}\nearrow & \downarrow^{\mathrlap{cb}} \\ E/(\mathcal{G} \times_\Sigma T \Sigma) &=& E/(\mathcal{G} \times_\Sigma T \Sigma) }

given dually

CE(E/(\mathcal{G} \times_\Sigma T\Sigma)) \overset{(\delta_{EL}\mathbf{L})^\ast}{\longleftarrow} CE(T^\ast_{inf}(E/(\mathcal{G}\times_\Sigma T \Sigma)))

\array{ \left\{ \phi^a_{,\mu_1 \cdots \mu_k} \right\} &\longleftarrow& \left\{ \phi^a_{,\mu_1 \cdots \mu_k} \right\} \\ \left\{ c^\alpha_{,\mu_1 \cdots \mu_k} \right\} &\longleftarrow& \left\{ c^\alpha_{,\mu_1 \cdots \mu_k} \right\} \\ \left\{ \frac{d^k}{ d x^{\mu_1} \cdots d x^{\mu_k}} \left( \frac{\delta_{EL} L}{\delta \phi^a} \right) \right\} &\longleftarrow& \left\{ \phi^\ddagger_{a,\mu_1 \cdots \mu_k} \right\} \\ \left\{ 0 \right\} &\longleftarrow& \left\{ c^\ddagger_{\alpha,\mu_1 \cdots \mu_k} \right\} }

Proof

The proof of this proposition is a special case of the observation that the differentials involved are part of the local BV-BRST differential; this will be a direct consequence of the proof of prop. below.

The local analog of def. is now the following definition of the “derived prolonged shell” of the theory (recall the ordinary prolonged shell $\mathcal{E}^\infty \hookrightarrow J^\infty_\Sigma(E)$ from (52)):

Definition

(derived reduced prolonged shell)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over some spacetime $\Sigma$ , and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a bundle of closed irreducible gauge parameters (def. ), inducing via prop. a section $\delta_{EL} L$ of the local cotangent Lie algebroid of the jet bundle gauge-action Lie algebroid.

Then the derived prolonged shell $(E/( \mathcal{G} \times_\Sigma T \Sigma ))_{\delta_{EL}L \simeq 0}$ is the derived critical locus of $\delta_{EL} L$ , hence the homotopy pullback of $\delta_{EL} L$ along the zero section of the local cotangent Lie $\infty$ -algebroid:

\array{ (E/( \mathcal{G} \times_\Sigma T \Sigma ))_{\delta_{EL}L \simeq 0} &\longrightarrow& E/( \mathcal{G} \times_\Sigma T \Sigma ) \\ \downarrow &(pb)& \downarrow^{\mathrlap{0}} \\ E/(\mathcal{G} \times_\Sigma T \Sigma) &\underset{\delta_{EL} L}{\longrightarrow}& T^\ast_{\Sigma,inf} \left( E/( \mathcal{G} \times_\Sigma T \Sigma ) \right) }

As before, for the purpose of our running examples the reader may take the following example as the definition of the derived reduced prolonged shell (def. ). This is local refinement of example :

Example

(local BV-BRST complex)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) over Minkowski spacetime $\Sigma$ , and let $\mathcal{G} \overset{gb}{\to} \Sigma$ be a gauge parameter bundle (def. ) which is closed (def. ). Assume that both are trivial vector bundles (example ) with field coordinates as in prop. .

Then the Chevalley-Eilenberg algebra of the derived prolonged shell $(E/( \mathcal{G} \times_\Sigma T \Sigma ))_{\delta_{EL}L \simeq 0}$ (def. ) is

CE\left( (E/( \mathcal{G} \times_\Sigma T \Sigma ))_{\delta_{EL}L \simeq 0} \right) \;=\; \left( C^\infty\left( T^\ast_{\Sigma,inf}( E \times_\Sigma \mathcal{G}[1] \times_\Sigma T^\ast \Sigma[1] ) \right) \,,\, \underset{ = s }{ \underbrace{ \left\{ \left(- L + L_{BRST}\right) dvol_\Sigma \,, (-) \right\} } } \;+\; d \right)

where the underlying graded algebra is the algebra of functions on the (-1)-shifted vertical cotangent bundle of the fiber product of the field bundle with the (+1)-shifted gauge parameter bundle (as in example ) and the shifted cotangent bundle of $\Sigma$ , and where the Chevalley-Eilenberg differential is the sum of the horizontal derivative $d$ with the BV-BRST differential

(193)

s \;\coloneqq\; \left\{ \left(- L + L_{BRST}\right) dvol_\Sigma \,, (-) \right\}

which is the local antibracket (def. ) with the BV-BRST Lagrangian density

\left( -L + L_{BRST}\right) \;\in\; \Omega^{p+1,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma \mathcal{G}[1] \right)\right)

which itself is the sum of (minus) the given Lagrangian density (def. ) with the BRST Lagrangian (192).

The action of the BV-BRST differential on the generators is as follows:

\array{ & & \array{ \text{BV-BRST differential} \\ s } & \\ \text{field} & \phi^a &\mapsto& \underset{ = s_{BRST}(\phi^a) }{ \underbrace{ \left( \underset{k \in \mathbb{N}}{\sum} c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{a \mu_1 \cdots \mu_k} \right) } } & \text{gauge symmetry} \\ \text{ ghost field } & c^\alpha &\mapsto& \underset{ = s_{BRST}(c^\alpha) }{ \underbrace{ \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma } } & \text{Lie bracket} \\ \text{antifield} & \phi^\ddagger_\alpha &\mapsto& \phantom{-} \underset{ = s_{BV}(\phi^\ddagger_a) }{ \underbrace{ \frac{\delta_{EL} L}{\delta \phi^a} }} & \text{equations of motion} \\ &&& \underset{ = s_{BRST}(\phi^\ddagger_a) }{ \underbrace{ - \left( \underset{k \in \mathbb{N}}{\sum} \frac{\delta_{EL}}{\delta \phi^a} \left( c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{b \mu_1 \cdots \mu_k} \phi^\ddagger_b \right) \right) } } & \\ \array{ \text{antifield of} \\ \text{ghost field} } & c^\ddagger_\alpha &\mapsto& \underset{ = s_{BV}(c^\ddagger_\alpha) }{ \underbrace{ - \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) } } & \text{Noether identities} \\ &&& + \underset{ = s_{BRST}(c^\ddagger_\alpha) }{ \underbrace{ \gamma^{\alpha'}{}_{ \alpha \beta} c^\beta c^\ddagger_{\alpha'} } } }

and this extends to jets of generator by $s \circ d + d \circ s = 0$ .

This is called the local BV-BRST complex.

By introducing a bigrading as in prop.

\array{ & \left( c^\ddagger_{\alpha, \mu_1 \cdots \mu_k} \right) & \left( \phi^\ddagger_{a, \mu_1 \cdots \mu_k} \right) & \left( \phi^a_{,\mu_1 \cdots \mu_k} \right) & \left( c^\alpha_{,\mu_1 \cdots \mu_k} \right) \\ deg_{gh} = & 0 & 0 & 0 & +1 \\ deg_{af} = & -2 & -1 & 0 & 0 }

this splits into the total complex of a bicomplex with

s \;=\; s_{BV} + s_{BRST}

with

\array{ & s_{BRST} & s_{BV} \\ deg_{gh} = & +1 & 0 \\ deg_{af} = & 0 & +1 }

as shown in the above table. Under this decomposition, the classical master equation

s^2 = 0 \phantom{AAAA} \Leftrightarrow \phantom{AAAA} \left\{ \left( -L + L_{BRST}\right) dvol_\Sigma \,,\, \left( -L + L_{BRST}\right) dvol_\Sigma \right\} = 0

is equivalent to three conditions:

\array{ \left( s_{BV} \right)^2 = 0 && \text{Noether's second theorem} \\ \left( s_{BRST} \right)^2 = 0 && \text{closure of gauge symmetry} \\ \left[ s_{BV}, s_{BRST} \right] = 0 && \left\{ \array{ \text{ gauge symmetry preserves the shell }, \\ \text{ gauge symmetry acts on Noether identities } } \right. }

(e.q. Barnich 10 (4.10))

Proof

Due to the construction in def. the BRST differential by itself is already assumed to square to the

\left(s_{BRST}\right)^2 = 0

The remaining conditions we may check on 0-jet generators.

The condition

\left( s_{BV} \right)^2 = 0

is non-trivial only on the antifields of the ghost fields. Here we obtain

\begin{aligned} s_{BV} s_{BV} c^\ddagger_\alpha & = -\underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) \\ & = -\underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \frac{\delta_{EL} L}{\delta \phi^a} \right) \end{aligned}

That this vanishes is the statement of Noether's second theorem (prop. ).

Next we check

s_{BV} \circ s_{BRST} + s_{BRST} \circ s_{BV} = 0

on generators. On the fields $\phi^a$ and the ghost fields $c^\alpha$ this is trivial (both summands vanish separately). On the antifields we get on the one hand

\begin{aligned} s_{BRST} s_{BV} \phi^{\ddagger}_a & = s_{BRST} \frac{\delta_{EL} L}{\delta \phi^a} \\ & = \underset{k}{\sum} \underset{q}{\sum} \frac{d^q}{d x^{\nu_1} \cdots d x^{\nu_q}} \left( c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{b \mu_1 \cdots \mu_k} \right) \frac{\partial}{\partial \phi^b_{,\nu_1 \cdots \nu_q}} \frac{\delta_{EL} L}{\delta \phi^a} \end{aligned}

and on the other hand

\begin{aligned} s_{BV} s_{BRST} \phi^\ddagger_a & = - s_{BV} \underset{k}{\sum} \frac{\delta_{EL}}{\delta \phi^a} \left( c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{b \mu_1 \cdots \mu_k} \phi^\ddagger_b \right) \\ & = + \underset{k}{\sum} \underset{q}{\sum} (-1)^q \frac{d^q}{d x^{\nu_1} \cdots d x^{\nu_q}} \left( \frac{\partial}{\partial \phi^a_{,\mu_1 \cdots \mu_q}} \left( c^\alpha_{,\mu_1 \cdots \mu_k} R_\alpha^{b \mu_1 \cdots \mu_k} \right) \frac{\delta_{EL} L}{\delta \phi^b} \right) \end{aligned}

That the sum of these two terms indeed vanishes is equation (82) in the proof of the on-shell invariance of the equations of motion under infinitesimal symmetries of the Lagrangian (prop. )

Finally, on antifields of ghostfields we get

\begin{aligned} s_{BV} s_{BRST} c^\ddagger_\alpha & = s_{BV} \gamma^{\alpha'}{}_{\alpha \beta} c^\beta c^\ddagger_{\alpha'} \\ & = - \gamma^{\alpha'}{}_{\alpha \beta} c^\beta \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_{\alpha'}^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) \end{aligned}

as well as

\begin{aligned} s_{BRST} s_{BV} c^\ddagger_\alpha & = s_{BRST} \left( \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) \right) \\ & = R \left( \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) \right) \;-\; \left( \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \left( \underset{q \in \mathbb{N}}{\sum} \frac{\delta_{EL}}{\delta \phi^a} \left( c^{\alpha'}_{,\nu_1 \cdots \nu_q} R_{\alpha'}^{b \nu_1 \cdots \nu_q} \phi^\ddagger_b \right) \right) \right) \right) \\ & + R \left( \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \phi^\ddagger_a \right) \right) \;-\; \left( \underset{k \in \mathbb{N}}{\sum} (-1)^k \frac{d^k}{d x^{\mu_1} \cdots d x^{\mu_k}} \left( R_\alpha^{a \mu_1 \cdots \mu_k} \left( \underset{q,r \in \mathbb{N}}{\sum} (-1)^{r} \frac{d^r}{d x^{\rho_1} \cdots d x^{\rho_r}} \left( c^{\alpha'}_{,\nu_1 \cdots \nu_q} \frac{\partial R_{\alpha'}^{b \nu_1 \cdots \nu_q}}{\partial \phi^a_{,\rho_1 \cdots \rho_r}} \phi^\ddagger_b \right) \right) \right) \right) \\ & = (R \cdot N_R)_a^b (\phi^\ddagger_b) \end{aligned}

where in the last line we identified the Lie algebra action of infinitesimal symmetries of the Lagrangian on Noether operators from def. . Under this identification, the fact that

\left( s_{BRST}s_{BV} + s_{BV} s_{BRST} \right) c^\ddagger_\alpha = 0

is relation (162) in prop. .

Example

(derived prolonged shell in the absence of explicit gauge symmetry – the local BV-complex)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) with vanishing gauge parameter bundle (def. ) (possibly because there are no non-trivial infinitesimal gauge symmetries, such as for the scalar field, or because none were chose), hence with no ghost fields introduced. Then the local derived critical locus of its Lagrangian density (def. ) is the plain local BV-complex of def. .

s = s_{BV} \,.

Example

(local BV-BRST complex of vacuum electromagnetism on Minkowski spacetime)

Consider the Lagrangian field theory of free electromagnetism on Minkowski spacetime (example ) with gauge parameter as in example . With the field and gauge parameter coordinates as chosen in these examples

\left( (a_\mu), c \right)

then the local BV-BRST complex (prop. ) has generators

\array{ & c^\ddagger & (a^\ddagger)^\mu & a_\mu & c \\ deg = & -2 & -1 & 0 & 1 }

together with their total spacetime derivatives, and the local BV-BRST differential $s$ acts on these generators as follows:

s \;\colon\; \left\{ \array{ (a^\dagger)^\mu &\mapsto& f^{\nu \mu}_{,\nu} & \text{(equations of Motion -- vacuum Maxwell equations)} \\ c^\ddagger &\mapsto& (a^\ddagger)^\mu_{,\mu} & \text{(Noether identity)} \\ a_\mu &\mapsto& c_{,\mu} & \text{(infinitesimal gauge transformation)} } \right.

More generally:

Example

(local BV-BRST complex of Yang-Mills theory)

For $\mathfrak{g}$ a semisimple Lie algebra, consider $\mathfrak{g}$ -Yang-Mills theory on Minkowski spacetime from example , with local BRST complex as in example , hence with BRST Lagrangian (192) given by

L_{BRST} = \left( c^\alpha_{,\mu} - \gamma^\alpha{}_{\beta \gamma}c^\beta a^\gamma_\mu \right) (a^\ddagger)_\alpha^\mu \;+\; \tfrac{1}{2} \gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma c^\ddagger_\alpha \,.

Then its local BV-BRST complex (example ) has BV-BRST differential $s = \left\{ -L + L_{BRST} \,,\, - \right\}$ given on 0-jets as follows:

\array{ & & s & \\ \text{field} & a_\mu^\alpha &\mapsto& c^\alpha_{,\mu} - \gamma^\alpha{}_{\beta \gamma}c^\beta a^\gamma_\mu & \text{gauge symmetry} \\ \text{ ghost field } & c^\alpha &\mapsto& \tfrac{1}{2}\gamma^\alpha{}_{\beta \gamma} c^\beta c^\gamma & \text{Lie bracket} \\ \text{antifield} & (a^\ddagger)^\mu_\alpha &\mapsto& \phantom{-} \left( \frac{d}{d x^\mu} f^{\mu \nu \alpha'} + \gamma^{\alpha'}{}_{\beta' \gamma} a_\mu^{\beta'} f^{\mu \nu \gamma} \right) k_{\alpha' \alpha} & \text{equations of motion} \\ &&& - \gamma^{\alpha'}{}_{\beta \alpha}c^\beta (a^\ddagger)_{\alpha'}^\mu & \\ \text{anti ghostfield} & c^\ddagger_\alpha &\mapsto& \gamma^{\alpha'}{}_{\alpha \gamma} a^\gamma_\mu (a^\ddagger)^\mu_{\alpha'} + \frac{d}{d x^\mu} (a^\ddagger)^\mu_\alpha & \text{Noether identities} \\ &&& + \gamma^{\alpha'}{}_{ \alpha \beta} c^\beta c^\ddagger_{\alpha'} }

(e.g. Barnich-Brandt-Henneaux 00 (2.8))

$\,$

So far the discussion yields just the algebra of functions on the derived reduced prolonged shell. We now discuss the derived analog of the full variational bicomplex (def. ) to the derived reduced shell.

$\,$

(derived variational bicomplex)

The analog of the de Rham complex of a derived Lie algebroid is called the Weil algebra:

Definition

(Weil algebra of a Lie algebroid)

Given a derived Lie algebroid $\mathfrak{a}$ over some $X$ (def. ), its Weil algebra is

W(\mathfrak{a}) \;\coloneqq\; \left( Sym_{C^\infty(X)}( \Gamma(T^\ast_{inf} X) \oplus \mathfrak{a}_\bullet \oplus \mathfrak{a}[1]_\bullet ) \;,\; \mathbf{d}_W \coloneqq \mathbf{d} + d_{CE} \right) \,,

where $\mathbf{d}$ acts as the de Rham differential $\mathbf{d} \colon C^\infty(X) \to \Gamma(T^\ast_{inf} X)$ on functions, and as the degree shift operator $\mathbf{d} \colon \mathfrak{a}_\bullet \to \mathfrak{a}[1]_\bullet$ on the graded elements.

smooth manifolds	derived Lie algebroids
algebra of functions	Chevalley-Eilenberg algebra
algebra of differential forms	Weil algebra

Example

(classical Weil algebra)

Let $\mathfrak{g}$ be a Lie algebra with corresponding Lie algebroid $B \mathfrak{g}$ (example ). Then the Weil algebra (def. ) of $B \mathfrak{g}$ is the traditional Weil algebra of $\mathfrak{g}$ from classical Lie theory.

Definition

(variational BV-bicomplex)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) equipped with a gauge parameter bundle $\mathcal{G}$ (def. ) which is closed (def. ). Consider the Lie algebroid $E/(\mathcal{G} \times_\Sigma T \Sigma)$ from example , whose Chevalley-Eilenberg algebra is the local BRST complex of the theory.

Then its Weil algebra $W(E/(\mathcal{G} \times_\Sigma T \Sigma))$ (def. ) has as differential the variational derivative (def. ) plus the BRST differential

\begin{aligned} d_{W} & = \mathbf{d} - (d - s_{BRST}) \\ & = \delta + s_{BRST} \end{aligned} \,.

Therefore we speak of the variational BRST-bicomplex and write

\Omega^\bullet_\Sigma( E/(\mathcal{G} \times_\Sigma T \Sigma) ) \,.

Similarly, the Weil algebra of the derived prolonged shell $(E/( \mathcal{G} \times_\Sigma T \Sigma ))_{\delta_{EL}L \simeq 0}$ (def. ) has differential

\begin{aligned} d_W & = \mathbf{d} - (d - s) \\ & = \delta + s \end{aligned} \,.

Since $s$ is the BV-BRST differential (prop. ) this defines the “BV-BRST variational bicomplex”.

$\,$

global BV-BRST complex

Finally we may apply transgression of variational differential forms to turn the local BV-BRST complex on smooth functions on the jet bundle into a global BV-BRST complex on graded local observables on the graded space of field histories.

Definition

(global BV-BRST complex)

Let $(E,\mathbf{L})$ be a Lagrangian field theory (def. ) equipped with a gauge parameter bundle $\mathcal{G}$ (def. ) which is closed (def. ). Then on the local observables (def. ) on the space of field histories (def. ) of the graded field bundle

E_{\text{BV-BRST}} = T^\ast_{\Sigma,inf}[-1](E \times_\Sigma \mathcal{G}[1])

underlying the local BV-BRST complex (189), consider the linear map

(194)

\array{ LocObs(E_{\text{BV-BRST}}) \otimes LocObs(E_{\text{BV-BRST}}) &\overset{\{-,-\}}{\longrightarrow}& LocObs(E_{\text{BV-BRST}}) \\ \tau_\Sigma(\alpha), \tau_\Sigma(\beta) &\mapsto& \tau_\Sigma( \{\alpha, \beta\} ) }

where $\alpha, \beta \in \Omega^{p+1,0}_{\Sigma,cp}(E_{\text{BV-BRST}})$ (def. ), where $\tau_\Sigma$ denotes transgression of variational differential forms (def. ), and where on the right $\{-,-\}$ is the local antibracket (def. ).

This is well-defined, in that this formula indeed depends on the horizontal differential forms $\alpha$ and $\beta$ only through the local observables $\tau_\Sigma(\alpha), \tau_\Sigma(\beta)$ which they induce. The resulting bracket is called the (global) antibracket.

Indeed the formula makes sense already if at least one of $\alpha, \beta$ have compact spacetime support (def. ), and hence the transgression of the BV-BRST differential (193) is a well-defined differential on the graded local observables

\left\{ -\tau_\Sigma \mathbf{L} + \tau_\Sigma \mathbf{L}_{BRST} \;,\, - \right\} \;\colon\; LocObs(E_{\text{BV-BRST}}) \longrightarrow LocObs(E_{\text{BV-BRST}}) \,,

where by example we may think of the first argument on the left as the BV-BRST action functional without adiabatic switching, which makes sense inside the antibracket when acting on functionals with compact spacetime support. Hence we may suggestively write

(195)

\left\{ -S + S_{BRST} \;,\;- \right\} \;\coloneqq\; \left\{ -\tau_\Sigma \mathbf{L} + \tau_\Sigma \mathbf{L}_{BRST} \;,\, - \right\}

for this (global) BV-BRST differential.

This uniquely extends as a graded derivation to multilocal observables (def. ) and from there along the dense subspace inclusion (107)

PolyMultiLocObs(E_{\text{BV-BRST}}) \overset{\text{dense}}{\hookrightarrow} PolyObs(E_{\text{BV-BRST}})

to a differential on off-shell polynomial observables (def. ):

\{-S' + S'_{BRST}\} \;\colon\; PolyObs(E_{\text{BV-BRST}}) \longrightarrow PolyObs(E_{\text{BV-BRST}})

This differential graded-commutative superalgebra

(196)

\left( \left( \underset{ \text{vector space} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}}) }} , \underset{ \text{product} }{ \underbrace{ (-)\cdot(-) }} \right) , \underset{ \text{differential} }{ \underbrace{ \{-S' + S'_{BRST}, -\} }} \right)

is the global BV-BRST complex of the given Lagrangian field theory with the chosen gauge parameters.

Proof

We need to check that the global antibracket (194) is well defined:

By the last item of prop. the horizontally exact horizontal differential forms form a “Lie ideal” for the local antibracket. With this the proof that the transgressed bracket is well defined is the same as the proof that the global Poisson bracket on the Hamiltonian local observables is well defined, def. .

Example

(global BV-differential in components)

In the situation of def. , assume that the field bundles of all fields, ghost fields and auxiliary fields are trivial vector bundles, with field/ghost-field/auxiliary-field coordinates on their fiber product bundle collectively denoted $(\phi^A)$ .

Then the first summand of the global BV-BRST differential (def. ) is given by

(197)

\begin{aligned} \left\{ -S', -\right\} & = \int_\Sigma j^{\infty}\left(\mathbf{\Phi}\right)^\ast \left( \frac{\overset{\leftarrow}{\delta}_{EL} L}{\delta \phi^A} \right)(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \\ & = \underset{A}{\sum} (-1)^{deg(\phi^A)} \int_\Sigma (P_{A B}\mathbf{\Phi}^A)(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \end{aligned}

where

$P \;\colon\; \Gamma_\Sigma(E) \longrightarrow \Gamma_\Sigma(E^\ast)$

is the differential operator (66) from def. , corresponding to the Euler-Lagrange equations of motion.
$deg(\phi^A) \coloneqq n_{(\Phi^A)} + \sigma_{\Phi^A} \;\in\; \mathbb{Z}/2$

is the sum of the cohomological degree and of the super-degree of $\Phi^A$ (as in def. , def. ).

It follows that the cochain cohomology of the global BV-differential $\{-S',-\}$ (196) in $deg_{af} = 0$ is the space of on-shell polynomial observables:

(198)

\underset{ \text{off-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}})_{def(af = 0)} }}/im(\{-S',-\}) \;\simeq\; \underset{ \text{on-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}}, \mathbf{L}') }} \,.

Proof

By definition, the part $\mathbf{L}'$ of the gauge fields Lagrangian density is independent of antifields, so that the local antibracket with $\mathbf{L}'$ reduces to

\left\{ -\mathbf{L}',-\right\} \;=\; \frac{\overset{\leftarrow}{\delta}_{EL} \mathbf{L}'}{\delta \phi^A} \frac{\delta}{\delta \phi^{\ddagger}_A}

With this the expression for $\{-S',-\}$ follows directly from the definition of the global antibracket (def. ) and the Euler-Lagrange equations (66)

(P \Phi)_A = j^\infty_\Sigma(\Phi)\left( \frac{\delta_{EL} L}{\delta \phi^A} \right) \,.

where the sign $(-1)^{deg(\phi^A)}$ is the relative sign between $\frac{\delta_{EL} L}{\delta \phi^A} = \frac{\overset{\rightarrow}{\delta}_{EL} L'}{\delta \phi^A}$ and $\frac{\overset{\leftarrow}{\delta}_{EL} L'}{\delta \phi^A}$ (def. ):

By the assumption that $L'$ defines a free field theory, $\mathbf{L}'$ is quadratic in the fields, so that from $deg(\mathbf{L}) = 0$ it follows that the derivations from the left and from the right differ by the relative sign

\begin{aligned} (-1)^{ \left( n_{(\phi^A)} n_{(\phi^A)} + \sigma_{(\phi^A)} \sigma_{(\phi^A)} \right) } & = (-1)^{ \left( n_{(\phi^A)} + \sigma_{(\phi^A)} \right) } \\ & = (-1)^{deg(\phi^A)} \end{aligned} \,.

From this the identification (198) follows by (102) in theorem .

$\,$

This concludes our discussion of the reduced phase space of a Lagrangian field theory exhibited, dually by its local BV-BRST complex. In the next chapter we finally turn to the key implication of this construction: the gauge fixing of a Lagrangian gauge theory which makes the collection of fields and auxiliary fields (ghost fields and antifields) jointly have a (differential-graded) covariant phase space.

Gauge fixing

In this chapter we discuss the following topics:

Quasi-isomorphisms between local BV-BRST complexes
1. gauge fixing chain maps;
2. adjoining contractible complexes of auxiliary fields
Example: gauge fixed electromagnetic field

$\,$

While in the previous chapter we had constructed the reduced phase space of a Lagrangian field theory, embodied by the local BV-BRST complex (example ), as the homotopy quotient by the infinitesimal gauge symmetries of the homotopy intersection with the shell, this in general still does not yield a covariant phase space of on-shell field histories (prop. ), since Cauchy surfaces for the equations of motion may still not exist (def. ).

However, with the homological resolution constituted by the BV-BRST complex in hand, we now have the freedom to adjust the field-content of the theory without changing its would-be reduced phase space, namely without changing its BV-BRST cohomology. In particular we may adjoin further “auxiliary fields” in various degrees, as long as they contribute only a contractible cochain complex to the BV-BRST complex. If such a quasi-isomorphism of BV-BRST complexes brings the Lagrangian field theory into a form such that the equations of motion of the combined fields, ghost fields and potential further auxiliary fields are Green hyperbolic differential equations after all, and thus admit a covariant phase space, then this is called a gauge fixing (def. below), since it is the infinitesimal gauge symmetries which obstruct the existence of Cauchy surfaces (by prop. and remark ).

The archetypical example is the Gaussian-averaged Lorenz gauge fixing of the electromagnetic field (example below) which reveals that the gauge-invariant content of electromagnetic waves is only in their transversal wave polarization (prop. below).

The tool of gauge fixing via quasi-isomorphisms of BV-BRST complexes finally brings us in position to consider, in the following chapters, the quantization also of gauge theories: We use gauge fixing quasi-isomorphisms to bring the BV-BRST complexes of the given Lagrangian field theories into a form that admits degreewise quantization of a graded covariant phase space of fields, ghost fields and possibly further auxiliary fields, compatible with the gauge-fixed BV-BRST differential:

$\,$

\array{ \underline{\mathbf{\text{pre-quantum geometry}}} && \underline{\mathbf{\text{higher pre-quantum geometry}}} \\ \, \\ \left\{ \array{ \text{Lagrangian field theory with} \\ \text{infinitesimal gauge transformations} } \right\} &\overset{ \text{homotopy quotient by} \atop \text{gauge transformations} }{\longrightarrow}& \left\{ \array{ \text{dg-Lagrangian field theory with} \\ \text{quotiented by gauge transformations} \\ \text{embodied by BRST complex } } \right\} \\ && \Big\downarrow{}^{\mathrlap{ \text{pass to} \atop \text{derived critical locus} }} \\ && \left\{ \array{ \text{dg-reduced phase space} \\ \text{ embodied by BV-BRST complex } } \right\} \\ && {}^{\mathllap{\simeq}}\Big\downarrow{}^{\mathrlap{\text{fix gauge} }} \\ \left\{ \array{ \text{ decategorified } \\ \text{ covariant } \\ \text{ reduced phase space } } \right\} &\underset{\text{pass to cohomology}}{\longleftarrow}& \left\{ \array{ \text{ dg-covariant} \\ \text{reduced phase space } } \right\} \\ && \Big\downarrow{}^{\mathrlap{ \array{ \text{ quantize } \\ \text{degreewise} } }} \\ \left\{ \array{ \text{gauge invariant} \\ \text{quantum observables} } \right\} &\underset{\text{pass to cohomology}}{\longleftarrow}& \left\{ \array{ \text{quantum} \\ \text{BV-BRST complex} } \right\} }

Here:

term	meaning
“phase space”	derived critical locus of Lagrangian equipped with Poisson bracket
“reduced”	gauge transformations have been homotopy-quotiented out
“covariant”	Cauchy surfaces exist degreewise

$\,$

quasi-isomorphisms between local BV-BRST complexes

Recall (prop. ) that given a local BV-BRST complex (example ) with BV-BRST differential $s$ , then the space of local observables which are on-shell and gauge invariant is the cochain cohomology of $s$ in degree zero:

H^0(s \vert d) \;=\; \left\{ \array{ \text{gauge invariant on-shell} \\ \text{local observables} } \right\}

The key point of having resolved (in chapter Reduced phase space) the naive quotient by infinitesimal gauge symmetries of the naive intersection with the shell by the L-infinity algebroid whose Chevalley-Eilenberg algebra is called the local BV-BRST complex, is that placing the reduced phase space into the context of homotopy theory/homological algebra this way provides the freedom of changing the choice of field bundle and of Lagrangian density without actually changing the Lagrangian field theory up to equivalence, namely without changing the cochain cohomology of the BV-BRST complex.

A homomorphism of differential graded-commutative superalgebras (such as BV-BRST complexes) which induces an isomorphism in cochain cohomology is called a quasi-isomorphism. We now discuss two classes of quasi-isomorphisms between BV-BRST complexes:

gauge fixing (def. below)
adjoining auxiliary fields (def. below).

$\,$

gauge fixing chain maps

Proposition

(local anti-Hamiltonian flow is automorphism of local antibracket)

Let

CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)_{\delta_{EL} L \simeq 0} \right) \;=\; \left( \Omega^{0,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left(E \times_\Sigma \mathcal{G}[1]\right) \times_\Sigma T \Sigma[1] \right) \;,\; d_{CE} = \underset{s}{ \underbrace{ \left\{ -\mathbf{L} + \mathbf{L}_{BRST} \,,\, - \right\} } } \;+\; d \right)

be a local BV-BRST complex of a Lagrangian field theory $(E,\mathbf{L})$ (example ).

Then for

\mathbf{L}_{gf} \;\in\; \Omega^{p+1,0} \left( T^\ast_{\Sigma,inf}\left(E \times_\Sigma \mathcal{G}\right) \times_\Sigma T \Sigma[1] \right)

a Lagrangian density (def. ) on the graded field bundle

\mathbf{L}_{gf} \;=\; L_{gf} \ dvol_\Sigma

of degree

deg(L) = (-1, even)

then the exponential of forming the local antibracket (def. ) with $\mathbf{L}_{gf}$

\array{ \Omega^{p+1,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma \mathcal{G}[1]\right) \right) & \overset{ e^{\left\{ \mathbf{L}_{gf} \,,\, -\right\}}(-) }{\longrightarrow} & \Omega^{p+1,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left( E \times_\Sigma \mathcal{G}[1]\right) \right) \\ \mathbf{K} &\mapsto& \left\{ \mathbf{L}_{gf} , \mathbf{K} \right\} + \tfrac{1}{2} \left\{ \mathbf{L}_{gf} \,,\, \left\{ \mathbf{L}_{gf} \,,\, \mathbf{K} \right\} \right\} + \tfrac{1}{6} \left\{ \mathbf{L}_{gf} \,,\,\left\{ \mathbf{L}_{gf} \,,\, \left\{ \mathbf{L}_{gf} \,,\,\mathbf{K} \right\} \right\} \right\} + \cdots }

is an endomorphism of the local antibracket (def. ) in that

e^{ \left\{ \mathbf{\psi} \,,\, - \right\} } \left( \left\{ \mathbf{A} \,,\, \mathbf{B} \right\} \right) \;=\; \left\{ e^{ \left\{ \mathbf{\psi} \,,\, - \right\} } \left(\mathbf{A}\right) \,,\, e^{ \left\{ \psi \,,\, - \right\} } \left(\mathbf{B}\right) \right\}

and in fact an automorphism, with inverse morphism given by

\left(e^{\left\{ \psi \,,\, -\right\}}(-)\right)^{-1} \;=\; e^{\left\{ -\psi \,,\, -\right\}}(-) \,.

We may think of this as the Hamiltonian flow of $\mathbf{L}_{gf}$ under the local antibracket.

In particular when applied to the BV-Lagrangian density

s_{gf} \;\coloneqq\; \left\{ e^{\left\{ \mathbf{L}_{gf},-\right\}}\left(- \mathbf{L} + \mathbf{L}_{BRST}\right) \,,\, - \right\}

this yields another differential

\left( s_{gf}\right)^2 \;=\; 0

and hence another differential graded-commutative superalgebra (def. )

CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)^{gf}_{\delta_{EL} L \simeq 0} \right) \;=\; \left( \Omega^{0,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left(E \times_\Sigma \mathcal{G}[1]\right) \times_\Sigma T \Sigma[1] \right) \;,\; d_{CE} = \underset{s_{gf}}{ \underbrace{ \left\{ e^{\left\{ \mathbf{L}_{gf}, - \right\}}\left( - \mathbf{L} + \mathbf{L}_{BRST} \right) \,,\, - \right\} } } \;+\; d \right)

Finally, $e^{\left\{\mathbf{L}_{gf},-\right\}}$ constitutes a chain map from the local BV-BRST complex to this deformed version, in fact a homomorphism of differential graded-commutative superalgebras, in that

s_{gf} \circ e^{ \left\{ \mathbf{L}_{gf}\,,\, - \right\} } \;=\; e^{ \left\{ \mathbf{L}_{gf}\,,\, - \right\} } \circ s \,.

Proof

By prop. the local antibracket $\left\{ -,-\right\}$ is a graded derivation in its second argument, of degree one more than the degree of its first argument (191). Hence for the first argument of degree -1 this implies that $e^{\{\mathbf{L}_{gf}, - \}}$ is an automorphism of the local antibracket. Moreover, it is clear from the definition that $\left\{ \mathbf{L}_{gf},-\right\}$ is a derivation with respect to the pointwise product of smooth functions, so that $e^{\{\mathbf{L}_{gf},-\}}$ is also a homomorpism of graded algebras.

Since $e^{\{\mathbf{L}_{gf}, -\}}$ is an automorphism of the local antibracket, and since $s$ and $s_{gf}$ are themselves given by applying the local antibracket in the second argument, this implies that $e^{\{\mathbf{L}_{gf},-\}}$ respects the differentials:

\array{ \mathbf{A} &\overset{e^{\{\mathbf{L}_{gf},-\}}}{\longrightarrow}& e^{\{\mathbf{L}_{gf},-\}}\left( \mathbf{A} \right) \\ {}^{\mathllap{s}}\downarrow && \downarrow^{\mathrlap{s_{gf}}} \\ \left\{ \left(-\mathbf{L} + \mathbf{L}_{BRST}\right)\,,\, \mathbf{A}\right\} &\underset{ e^{\{\mathbf{L}_{gf}\,,\,-\}} }{\longrightarrow}& \left\{ e^{\{\mathbf{L}_{gf},-\}}\left(-\mathbf{L} + \mathbf{L}_{BRST}\right) \,,\, e^{\{\mathbf{L}_{gf},-\}}(\mathbf{A}) \right\} }

Definition

(gauge fixing Lagrangian density)

Let

CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)_{\delta_{EL} L \simeq 0} \right) \;=\; \left( \Omega^{0,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left(E \times_\Sigma \mathcal{G}[1]\right) \times_\Sigma T \Sigma[1] \right) \;,\; d_{CE} = \underset{s}{ \underbrace{ \left\{ -\mathbf{L} + \mathbf{L}_{BRST} \,,\, - \right\} } } \;+\; d \right)

be a local BV-BRST complex of a Lagrangian field theory $(E,\mathbf{L})$ (example ) and let

\mathbf{L}_{gf} \;\in\; \Omega^{p+1,0} \left( T^\ast_{\Sigma,inf}\left(E \times_\Sigma \mathcal{G}\right) \times_\Sigma T \Sigma[1] \right)

be a Lagrangian density (def. ) on the graded field bundle such that

deg(L_{gf}) = -1 \,.

If the quasi-isomorphism of BV-BRST complexes given by the local anti-Hamiltonian flow $\mathbf{L}_{gf}$ via prop.

e^{\left\{ \mathbf{L}_{gf},-\right\}} \;\colon\; CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)^{gf}_{\delta_{EL} L \simeq 0} \right) \overset{\phantom{A}\simeq_{qi}\phantom{A}}{\longrightarrow} CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)^{gf}_{\delta_{EL} L \simeq 0} \right)

is such that for the transformed graded Lagrangian field theory

(199)

-\underset{deg_{af} = 0}{\underbrace{\mathbf{L}' }} + \mathbf{L}'_{BRST} \;\coloneqq\; e^{\{\mathbf{L}_{gf},-\}}(-\mathbf{L} + \mathbf{L}_{BRST})

(with Lagrangian density $\mathbf{L}'$ the part independent of antifields) the Euler-Lagrange equations of motion (def. ) admit Cauchy surfaces (def. ), then we call $\mathbf{L}_{gf}$ a gauge fixing Lagrangian density for the original Lagrangian field theory, and $\mathbf{L}'$ the corresponding gauge fixed form of the original Lagrangian density $\mathbf{L}$ .

Remark

(warning on terminology)

What we call a gauge fixing Lagrangian density $\mathbf{L}_{gf}$ in def. is traditionally called a gauge fixing fermion and denoted by “ $\psi$ ” (Henneaux 90, section 8.3, 8.4).

Here “fermion” is meant as a reference to the fact that the cohomological degree $deg(L_{gf}) = -1$ , which is reminiscent of the odd super-degree of fermion fields such as the Dirac field (example ); see at signs in supergeometry the section The super odd sign rule.

Example

(gauge fixing via anti-Lagrangian subspaces)

Let $\mathbf{L}_{gf}$ be a gauge fixing Lagrangian density as in def. such that

its local antibracket-square vanishes

$\left\{ \mathbf{L}_{gf},\, \left\{ \mathbf{L}_{gf}, \, -\right\} \right\} = 0$

hence its anti-Hamiltonian flow has at most a linear component in its argument $\mathbf{A}$ :

$e^{\left\{ \mathbf{L}_{gf} \,,\, \mathbf{A} \right\}} \;=\; \mathbf{A} + \left\{ \mathbf{L}_{gf} \,,\, \mathbf{A} \right\}$
it is independent of the antifields

$deg_{af}\left( L_{gf} \right) \;=\; 0 \,.$

Then with

$(\phi^A)$ collectively denoting all the field coordinates

(including the actual fields $\phi^a$ , the ghost fields $c^\alpha$ as well as possibly further auxiliary fields)
$(\phi^\ddagger_A)$ collectively denoting all the antifield coordinates

(includion the antifields $\phi^\ddagger_a$ of the actual fields, the antifields $c^\ddagger_\alpha$ of the ghost fields as well as those of possibly further auxiliary fields )

we have

\begin{aligned} (\phi')^A & \coloneqq e^{\left\{ \mathbf{L}_{gf}\,,\, - \right\}}(\phi^A) \\ & = \phi^A \\ \phantom{A} \\ (\phi')^\ddagger_A & \coloneqq e^{\left\{ \mathbf{L}_{gf}\,,\, - \right\}} \left( \phi^\ddagger_A \right) \\ & = \phi^\ddagger_A - \frac{\overset{\leftarrow}{\delta}_{EL} \mathbf{L}_{gf}}{\delta \phi^a} \end{aligned}

(and similarly for the higher jets); and the corresponding transformed Lagrangian density (199) may be written as

\begin{aligned} -\mathbf{L}' + \mathbf{L}'_{BRST} & \coloneqq e^{\left\{ \mathbf{L}_{gf}\,,\, - \right\}}\left( -\mathbf{L} + \mathbf{L}_{BRST} \right) \\ & = \left( -\mathbf{L} + \mathbf{L}_{BRST} \right) \left( \phi', (\phi')^\ddagger \right) \end{aligned} \,,

where the notation on the right denotes that $\phi'$ is substituted for $\phi$ and $\phi'_\ddagger$ for $\phi_\ddagger$ .

This means that the defining condition that $\mathbf{L}'$ be the antifield-independent summand (199), which we may write as

\mathbf{L}' \coloneqq \left( -\mathbf{L} + \mathbf{L}_{BRST} \right) \left( \phi'(\phi), \phi_\ddagger = 0 \right)

translates into

\mathbf{L}' \coloneqq \left( -\mathbf{L} + \mathbf{L}_{BRST} \right) \left( \phi', (\phi')^\ddagger_A = -\frac{\overset{\leftarrow}{\delta}_{EL} L_{gf}}{\delta \phi^A} \right) \,.

In this form BV-gauge fixing is considered traditionally (e.g. Hennaux 90, section 8.3, page 83, equation (76b) and item (iii)).

$\,$

adjoining contractible cochain complexes of auxiliary fields

Typically a Lagrangian field theory $(E,\mathbf{L})$ for given choice of field bundle, even after finding appropriate gauge parameter bundles $\mathcal{G}$ , does not yet admit a gauge fixing Lagrangian density (def. ). But if the gauge parameter bundle has been chosen suitably, then the remaining obstruction vanishes “up to homotopy” in that a gauge fixing Lagrangian density does exist if only one adjoins sufficiently many auxiliary fields forming a contractible complex, hence without changing the cochain cohomology of the BV-BRST complex:

Definition

(auxiliary fields and antighost fields)

Over Minkowski spacetime $\Sigma$ , let

A \overset{aux}{\longrightarrow} \Sigma

be any graded vector bundle (remark ), to be regarded as a field bundle (def. ) for auxiliary fields. If this is a trivial vector bundle (example ) we denote its field coordinates by $(b^i)$ . On the corresponding graded bundle with degrees shifted down by one

A[-1] \overset{aux[-1]}{\longrightarrow} \Sigma

we write $(\overline{c}^i)$ for the induced field coordinates.

Accordingly, the shifted infinitesimal vertical cotangent bundle (def. ) of the fiber product of these bundles

T^\ast_{\Sigma,inf}[-1]\left( A \times_\Sigma A^\ast[-1] \right)

has the following coordinates:

\array{ \text{name:} & \array{ \text{antifield of} \\ \text{antighost field} } & \array{ \text{antifield of} \\ \text{auxiliary field} } & \text{antighost field} & \text{auxiliary field} \\ \text{symbol:} & \overline{c}^\ddagger_i & b^\ddagger_i & \overline c^i & b^i \\ deg = & -(deg(b^i)-1)-1 & -deg(b^i)-1 & deg(b^i)-1 & deg(b^i) \\ & = -deg(b^i) }

On this fiber bundle consider the Lagrangian density (def. )

(200)

\mathbf{L}_{aux} \;\in\; \Omega^{p+1,0}_\Sigma( T^\ast_{\Sigma,inf}[-1]\left( A \times_\Sigma A[-1] \right) )

given in local coordinates by

\mathbf{L}_{aux} \;\coloneqq\; \overline{c}^\ddagger_i b^i \, dvol_\Sigma \,.

This is such that the local antibracket (def. ) with this Lagrangian acts on generators as follows:

(201)

\array{ && \left\{ \mathbf{L}_{aux},- \right\} \\ \text{auxiliary field} & b^i &\mapsto& 0 \\ \text{antighost field} & \overline{c}^i &\mapsto& b^i \\ \text{antifield of auxiliary field} & b^\ddagger_i &\mapsto& - \overline{c}^\ddagger_i \\ \text{antifield of antighost field} & \overline{c}^\ddagger_i &\mapsto& 0 }

Remark

(warning on terminology)

Beware that when adjoining antifields as in def. to a Lagrangian field theory which also has ghost fields $(c^\alpha)$ adjoined (example ) then there is no relation, a priori, between

the “antighost field” $\overline{c}^i$

and

the “antifield of the ghost field” $c^\ddagger_\alpha$

In particular there is also the

“antifield of the antighost field” $\overline{c}^\ddagger_i$

The terminology and notation is maybe unfortunate but entirely established.

The following is immediate from def. , in fact this is the purpose of the definition:

Proposition

(adjoining auxiliary fields is quasi-isomorphism of BV-BRST complexes)

Let

CE\left( E/(\mathcal{G} \times_\Sigma T\Sigma)_{\delta_{EL} L \simeq 0} \right) \;=\; \left( \Omega^{0,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1]\left(E \times_\Sigma \mathcal{G}[1]\right) \times_\Sigma T \Sigma[1] \right) \;,\; d_{CE} = \underset{s}{ \underbrace{ \left\{ -\mathbf{L} + \mathbf{L}_{BRST} \,,\, - \right\} } } \;+\; d \right)

be a local BV-BRST complex of a Lagrangian field theory $(E,\mathbf{L})$ (example ).

Let moreover $A \overset{aux}{\longrightarrow} \Sigma$ be any auxiliary field bundle (def. ). Then on the fiber product of the original field bundle $E$ and the shifted gauge parameter bundle $\mathcal{G}[1]$ with the auxiliary field bundle $A$ the sum of the original BV-Lagrangian density $-\mathbf{L} + \mathbf{L}_{BRST}$ with the auxiliary Lagrangian density $\mathbf{L}_{aux}$ (200) induce a new differential graded-commutative superalgebra:

\begin{aligned} & CE\left( E/(\mathcal{G} \times_\Sigma (A \times_\Sigma A[-1]) \times_\Sigma T \Sigma)^{aux}_{\delta_{EL} L \simeq 0} \right) \\ & \coloneqq\; \left( \Omega^{0,0}_\Sigma\left( T^\ast_{\Sigma,inf}[-1] \left( E \times_\Sigma \mathcal{G}[1] \times_\Sigma \left( A \times_\Sigma A[-1]\right) \right) \times_\Sigma T \Sigma[1] \right) \;,\; d_{CE} = \underset{s}{ \underbrace{ \left\{ \left( - L + L_{BRST} + \mathbf{L}_{aux} \right) dvol_\Sigma \,,\, - \right\} } } \;+\; d \right) \end{aligned}

with generators

\array{ \text{fields} & \phi^a & E & \phi^\ddagger_a & \text{antifields} \\ \\ \text{ghost fields} & c^\alpha & \mathcal{G}[1] & c^\ddagger_\alpha & \array{ \text{antifields of} \\ \text{ghost fields} } \\ \\ \text{ auxiliary fields } & b^i & A & b^\ddagger_i & \array{ \text{antifields of} \\ \text{auxiliary fields} } \\ \\ \text{ antighost fields } & \overline{c}^i & A[-1] & \overline{c}^{\ddagger}_i & \array{ \text{antifields of} \\ \text{antighost fields} } }

Moreover, the differential graded-commutative superalgebra of auxiliary fields and their antighost fields is a contractible chain complex

\left( \Omega^{0,0}_\Sigma( A \times_{\Sigma} A[-1] ) \,,\, d_{CE} = \left\{ \overline{c}^\ddagger_i b^i \, dvol_\Sigma \,,\, - \right\} \right) \overset{\simeq_{qi}}{\longrightarrow} 0

and thus the canonical inclusion map

CE\left( E/(\mathcal{G} \times_\Sigma \times_\Sigma T \Sigma)_{\delta_{EL} L \simeq 0} \right) \overset{\phantom{AA} \simeq_{qi} \phantom{aa}}{\hookrightarrow} CE\left( E/(\mathcal{G} \times_\Sigma (A \times_\Sigma A[-1]) \times_\Sigma T \Sigma)^{aux}_{\delta_{EL} L \simeq 0} \right)

(of the original BV-BRST complex into its tensor product with that for the auxiliary fields and their antighost fields) is a quasi-isomorphism.

Proof

From (201) we read off that

the map $s_{aux} \coloneqq \left\{ \mathbf{L}_{aux},- \right\}$ is a differential (squares to zero), and the auxiliary Lagrangian density satisfies its classical master equation (remark ) strictly

$\{\mathbf{L}_{aux}, \mathbf{L}_{aux}\} = 0$
the cochain cohomology of this differential is trivial:

$H^\bullet( s_{aux} )\;=\;0$
The local antibracket of the BV-Lagrangian density with the auxiliary Lagrangian density vanishes:

$\left\{ - \mathbf{L} + \mathbf{L}_{BRST} \,,\, \mathbf{L}_{aux} \right\} \;=\; 0$

Together this implies that the sum $-\mathbf{L} + \mathbf{L}_{BRST} + \mathbf{L}_{aux}$ satisfies the classical master equation (remark )

\left\{ \left( - \mathbf{L} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} \right) \,,\, \left( - \mathbf{L} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} \right) \right\} \;=\; 0

and hence that

s + s_{aux} \;\coloneqq\; \left\{ - \mathbf{L} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} \,,\, - \right\}

is indeed a differential; such that its cochain cohomology is identified with that of $s = \left\{-\mathbf{L} + \mathbf{L}_{BRST},-\right\}$ under the canonical inclusion map.

Remark

(gauge fixed BV-BRST field bundle)

In conclusion, we have that, given

$(E,\mathbf{L})$ a Lagrangian field theory (def. ), with field bundle $E$ (def. );
$\mathcal{G}$ a choice of gauge parameters (def. ),

hence

$\mathcal{G}[1]$ a choice of ghost fields (example );
$A$ a choice of auxiliary fields (def. ),

hence

$A[-1]$ a choice of antighost fields (def. )
$T^\ast_{\Sigma,inf}[-1](\cdots)$ the corresponding antifields (def. )
a gauge fixing Lagrangian density $\mathbf{L}_{gf}$ (def. )

then the result is a new Lagrangian field theory

\left( E_{\text{BV-BRST}}, \mathbf{L}' \right)

now with graded field bundle (remark ) the fiber product

E_{\text{BV-BRST}} \;\coloneqq\; \underset{ \array{ \text{anti-} \\ \text{fields} } }{ \underbrace{ T^\ast_{\Sigma,inf} } } \left( \underset{\text{fields}}{\underbrace{E}} \times_\Sigma \underset{ \array{ \text{ghost} \\ \text{fields} }}{\underbrace{\mathcal{G}[1]}} \times_\Sigma \underset{\array{ \text{auxiliary} \\ \text{fields} }}{\underbrace{A}} \times_{\Sigma} \underset{ \array{ \text{antighost} \\ \text{fields} } }{\underbrace{A[-1]}} \right)

and with Lagrangian density $\mathbf{L'}$ independent of the antifields, but complemented by an auxiliary Lagrangian density $\mathbf{L}'_{BRST}$ .

The key point being that $\mathbf{L}'$ admits a covariant phase space (while $\mathbf{L}$ may not), while in BV-BRST cohomology both theories still have the same gauge-invariant on-shell observables.

$\,$

Gauge fixed electromagnetic field

As an example of the general theory of BV-BRST gauge fixing above we now discuss the gauge fixing of the electromagnetic field.

Example

(Gaussian-averaged Lorenz gauge fixing of vacuum electromagnetism)

Consider the local BV-BRST complex for the free electromagnetic field on Minkowski spacetime from example :

The field bundle is $E \coloneqq T^\ast \Sigma$ and the gauge parameter bundle is $\mathcal{G} \coloneqq \Sigma \times \mathbb{R}$ . The 0-jet field coordinates are

\array{ & c^\ddagger & (a^\ddagger)^\mu & a_\mu & c \\ deg = & -2 & -1 & 0 & 1 }

the Lagrangian density is (43)

(202)

\mathbf{L}_{EM} \coloneqq \tfrac{1}{2} f_{\mu \nu} f^{\mu \nu}

and the BV-BRST differential acts as:

\array{ & &\array{ \text{BV-BRST} \\ \text{differential} }& \\ \array{ \text{ electromagnetic field } \\ \text{ ("vector potential") } } & a_\mu &\mapsto& c_{,\mu} & \text{gauge transformation} \\ \phantom{A} \\ \text{ ghost field } & c &\mapsto& 0 & \text{abelian Lie algebra} \\ \phantom{A} \\ \array{ \text{antifield of} \\ \text{electromagnetic field} } & (a^\ddagger)^\mu &\mapsto& f^{\nu \mu}_{,\nu} & \text{equations of motion} \\ \phantom{A} \\ \array{ \text{antifield of} \\ \text{ghostfield} } & c^\ddagger &\mapsto& (a^\ddagger)^\mu_{,\mu} & \text{Noether identity} \\ \phantom{A} \\ \text{Nakanishi-Lautrup field} & b &\mapsto& 0 & \text{vanishing of auxiliary fields...} \\ \phantom{A} \\ \text{antighost field} & \overline{c} &\mapsto& b & \text{... in cohomology} \\ \phantom{A} \\ \array{ \text{antifield of} \\ \text{ Nakanishi-Lautrup field } } & b^\ddagger &\mapsto& -\overline{c}^\ddagger & \text{vanishing of antifields of auxiliary fields...} \\ \phantom{A} \\ \array{ \text{antifield of} \\ \text{antighost field} } & \overline{c}^\ddagger &\mapsto& 0 & \text{... in cohomology} }

Introduce a trivial real line bundle for auxiliary fields $b$ in degree 0 and their antighost fields $\overline{c}$ (def. ) in degree -1:

\array{ & \Sigma \times \langle \overline{c}\rangle &\overset{ \overline{c} \mapsto b}{\longrightarrow}& \Sigma \times\langle b\rangle \\ deg = & -1 && 0 } \,.

In the present context the auxiliary field $b$ is called the abelian Nakanishi-Lautrup field.

The corresponding BV-BRST complex with auxiliary fields adjoined, which, by prop. , is quasi-isomorphic to the original one above, has coordinate generators

\array{ & c^\ddagger & (a^\ddagger)^\mu & a_\mu & c \\ & & \overline{c} & b \\ & & b^{\ddagger} & \overline{c}^\ddagger \\ deg = & -2 & -1 & 0 & 1 } \,.

and BV-BRST differential given by the local antibracket (def. ) with $-\mathbf{L}_{EM} + \mathbf{L}_{BRST} + \mathbf{L}_{aux}$ :

s \;=\; \left\{ \left( - \underset{ = L_{EM}}{\underbrace{\tfrac{1}{2}f_{\mu \nu} f^{\mu \nu}}} + \underset{ = L_{BRST} }{\underbrace{ c_{,\mu} (a^\ddagger)^\mu }} + \underset{ = L_{aux} }{\underbrace{ b \overline{c}^{\ddagger} }} \right) dvol_\Sigma \,,\, (-) \right\}

We say that the gauge fixing Lagrangian (def. ) for Gaussian-averaged Lorenz gauge_ for the electromagnetic field

\mathbf{L}_{gf} \;\in\; \Omega^{p+1}_\Sigma\left( E \times_\Sigma \mathcal{G}[1] \times_\Sigma A \times_\Sigma A[-1] \right) \,.

is given by (Henneaux 90 (103a))

(203)

\mathbf{L}_{gf} \;\coloneqq\; \underset{deg = -1}{ \underbrace{ \phantom{A}\overline{c}\phantom{A} }} \underset{deg = 0}{\underbrace{( b - a^{\mu}_{,\mu} )}} \, dvol_\Sigma \,.

We check that this really is a gauge fixing Lagrangian density according to def. :

From (202) and (203) we find the local antibrackets (def. ) with this gauge fixing Lagrangian density to be

\begin{aligned} \left\{\mathbf{L}_{gf}\,,\,\left( - \mathbf{L}_{EM} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} \right) \right\} & = \left\{ \overline{c}\left( b - a^\mu_{,\mu}\right) \, dvol_\Sigma \,,\, \left( -\tfrac{1}{2}f_{\mu \nu}f^{\mu \nu} + c_{,\mu} (a^\ddagger)^\mu + b \overline{c}^\ddagger \right) dvol_\Sigma \right\} \\ & = \left\{ \overline{c}\left( b - a^\mu_{,\mu}\right) \, dvol_\Sigma \,,\, b \overline{c}^{\ddagger} \, dvol_\Sigma \right\} + \left\{ \overline{c}\left( b - a^\mu_{,\mu}\right) \, dvol_\Sigma \,,\, c_{,\mu} (a^{\ddagger})^\mu \, dvol_\Sigma \right\} \\ & = - \left( b ( b - a^{\mu}_{,\mu} ) + \overline{c}_{,\mu} c^{,\mu} \right) \, dvol_\Sigma \\ \phantom{A} \\ \{ \mathbf{L}_{gf}, \{ \mathbf{L}_{gf} , (-\mathbf{L} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} )\}\} & = 0 \end{aligned}

(So we are in the traditional situation of example .)

Therefore the corresponding gauge fixed Lagrangian density (199) is (see also Henneaux 90 (103b)):

(204)

\begin{aligned} -\mathbf{L}' + \mathbf{L}'_{BRST} & \coloneqq e^{\left\{ \mathbf{L}_{gf} ,-\right\}}\left( -\mathbf{L}_{EM} + \mathbf{L}_{BRST} + \mathbf{L}_{aux} \right) \\ & = - \underset{ = \mathbf{L}' }{ \underbrace{ \left( \underset{ = L_{EM} }{ \underbrace{ \tfrac{1}{2} f_{\mu \nu} f^{\mu \nu} } } + \underset{ = -\left\{ L_{gf}, L_{BRST} + L_{aux} \right\} }{ \underbrace{ b ( b - a^{\mu}_{,\mu} ) + \overline{c}_{,\mu} c^{,\mu} } } \right) dvol_\Sigma } } \;+\; \underset{ = \mathbf{L}'_{BRST} }{ \underbrace{ \left( \underset{ = L_{BRST} }{ \underbrace{ c_{,\mu} (a^\ddagger)^\mu } } + \underset{ = L_{aux} }{ \underbrace{ b \overline{c}^\ddagger } } \right) dvol_\Sigma } } \end{aligned} \,.

The Euler-Lagrange equation of motion (def. ) induced by the gauge fixed Lagrangian density $\mathbf{L}'$ at antifield degree 0 are (using (64)):

(205)

\delta_{EL} \mathbf{L}' \;=\; 0 \phantom{AAA} \Leftrightarrow \phantom{AAA} \left\{ \begin{aligned} -\frac{d}{d x^\mu} f^{\mu \nu} & = b^{,\nu} \\ b & = \tfrac{1}{2} a^\mu_{,\mu} \\ c_{,\mu}{}^{,\mu} & = 0 \\ \overline{c}_{,\mu}{}^{,\mu} & = 0 \end{aligned} \right. \phantom{AAA} \Leftrightarrow \phantom{AAA} \left\{ \begin{aligned} \Box a^\mu & = 0 \\ b & = \tfrac{1}{2} div a \\ \Box c & = 0 \\ \Box \overline{c} & = 0 \end{aligned} \right.

(e.g. Rejzner 16 (7.15) and (7.16)).

(Here in the middle we show the equations as the appear directly from the Euler-Lagrange variational derivative (prop. ). The differential operator $\Box = \eta^{\mu \nu} \frac{d}{d x^\mu} \frac{d}{d x^\nu}$ on the right is the wave operator (example ) and $div$ denotes the divergence. The equivalence to the equations on the right follows from using in the first equation the derivative of the second equation on the left, which is $b^{,\nu} = \tfrac{1}{2} a^{\mu,\nu}{}_{,\mu}$ and recalling the definition of the universal Faraday tensor (30): $\frac{d}{d x^\mu} f^{\mu \nu} = \tfrac{1}{2} \left( a^{\nu,\mu}{}_{,\mu} - a^{\mu,\nu}{}_{,\mu} \right)$ .)

Now the differential equations for gauge-fixed electromagnetism on the right in (205) are nothing but the wave equations of motion of $(p+1) + 1 + 1$ free massless scalar fields (example ).

As such, by example they are a system of Green hyperbolic differential equations (def. ), hence admit Cauchy surfaces (def. ).

Therefore (204) indeed is a gauge fixing of the Lagrangian density of the electromagnetic field on Minkowski spacetime according to def. .

The gauge-fixed BRST operator induced from the gauge fixed Lagrangian density (204) acts as

(206)

\array{ & \array{ s'_{BRST} = \\ \left\{ \left( c_{,\mu} (a^\ddagger)^\mu + b \overline{c}^{\ddagger}\right) dvol_\Sigma, (-) \right\} } \\ a_\mu &\mapsto& c_{,\mu} \\ b &\mapsto& 0 \\ \overline{c} &\mapsto & b }

From this we immediately obtain the propagators for the gauge-fixed electromagnetic field:

Proposition

(photon propagator in Gaussian-averaged Lorenz gauge)

After fixing Gaussian-averaged Lorenz gauge (example ) of the electromagnetic field on Minkowski spacetime, the causal propagator (prop. ) of the combined electromagnetic field and Nakanishi-Lautrup field is of the form

\Delta^{EM, EL} \;=\; \left( \array{ \Delta^{photon} & \ast \\ \ast & \ast } \right)

with

\Delta^{photon}_{\mu \nu}(x,y) \;=\; \eta_{\mu \nu} \Delta(x,y) \,,

where

$\eta_{\mu \nu}$ is the Minkowski metric tensor (def. );
$\Delta(x,y)$ is the causal propagator of the free field theory massless real scalar field (prop. ).

Accordingly the Feynman propagator of the electromagnetic field in Gaussian-averaged Lorenz gauge is

(\Delta^{photon}_F)_{\mu \nu}(x,y) \;=\; \eta_{\mu \nu} \Delta_F(x,y) \,,

where on the right $\Delta_F(x,y)$ is the Feynman propagator of the free massless real scalar field (def. ).

This is also called the photon propagator.

Hence by prop. the distributional Fourier transform of the photon propagator is

\widehat{\Delta^{photon}_F}_{\mu \nu}(k) \;=\; \frac{1}{- k^\mu k_\mu + i 0^+} \,.

(this is a special case of Khavkine 14 (99), see also Rejzner 16, (7.20))

Proof

The Gaussian-averaged Lorenz gauge-fixed equations of motion (205) of the electromagnetic field are just $(p+1)$ uncoupled massless Klein-Gordon equations, hence wave equations (example ) for the $(p+1)$ real components of the electromagnetic field (“vector potential”)

\Box A_\mu = 0\phantom{AAAA} \mu \in \{0,1,\cdots, p\} \,.

This shows that the propoagator is proportional to that of the real scalar field.

To see that the index structure is as claimed, recall that the domain and codomain of the advanced and retarded propagators in def. is

\array{ \Gamma_\Sigma(T\Sigma) &\overset{\left( (\mathrm{G}_{\pm})_{\mu \nu} \right)}{\longrightarrow}& \Gamma_\Sigma(T^\ast \Sigma) }

corresponding to a differential operator for the equations of motion which by (64) and (205) is given by

\array{ \Gamma_\Sigma(T^\ast \Sigma) &\overset{ \eta^{-1} \circ \Box }{\longrightarrow}& \Gamma_\Sigma(T \Sigma) \\ A_\mu &\mapsto& \eta^{\mu \nu} \Box A_\nu }

Then the defining equation (93) for the advanced and retarded Green functions is, in terms of their integral kernels, the advanced and retarded propagators $\Delta_{\pm}$

\eta^{\mu' \mu} \Box \underset{y \in X}{\int} (\Delta_{\pm})_{\mu \nu}((-),y) A^{\nu}(y) \, dvol_\Sigma(x) = A^\nu(x) \,.

This shows that

(\Delta_{\pm})_{\mu \nu} \;=\; \eta_{\mu\nu} \Delta_{\pm}

with $\Delta_{\pm}$ the advanced and retarded propagator of the free real scalar field on Minkowski spacetime (prop. ), and hence

\begin{aligned} \Delta_{\mu \nu} &= (\Delta_+)_{\mu \nu} - (\Delta_-)_{\mu \nu} \\ & = \eta_{\mu \nu} (\Delta_+ - \Delta_-) \\ & = \eta_{\mu \nu} \Delta \end{aligned}

Next we compute the gauge-invariant on-shell polynomial observables of the electromagnetic field. The result will involve the following concept:

Definition

(wave polarization of linear observables of the electromagnetic field)

Consider the electromagnetic field on Minkowski spacetime $\Sigma$ , with field bundle the cotangent bundle

The space of off-shell linear observables is spanned by the point evaluation observables

e^\mu \mathbf{A}_\mu(x) \;\in\; LinObs(T^\ast \Sigma)

where

$e = (e^\mu) \in \mathbb{R}^{p,1}$ is some vector;
$x \in \mathbb{R}^{p,1}$ is some point in Minkowski spacetime
$\mathbf{A}_\mu(x) \;\colon\; A \mapsto A_\mu(x)$

is the functional which sends a section $A \in \Gamma_\Sigma(E) = \Omega^1(\Sigma)$ to its $\mu$ -component at $x$ .

After Fourier transform of distributions this is

e^\mu \widehat{\mathbf{A}}_\mu(k) \;\in\; LinObs(T^\ast \Sigma)

for $k = (k_\mu) \in (\mathbb{R}^{p,1})^\ast$ the wave vector

for $e = (e^\mu) \in \mathbb{R}^{p,1}$ the wave polarization

The linear on-shell observables are spanned by the same expressions, but subject to the condition that

{\vert k\vert}_\eta^2 = k^\mu k_\mu = 0

hence

LinObs(T^\ast \Sigma,\mathbf{L}_{EM}) \;=\; \left\langle e^\mu \widehat{\mathbf{A}}_\mu(k) \;\vert\; k^\mu k_\mu = 0 \right\rangle

We say that the space of transversally polarized linear on-shell observables is the quotient vector space

(207)

LinObs(T^\ast \Sigma,\mathbf{L}_{EM})_{trans} \;\coloneqq\; \frac{ \langle e^\mu \widehat{\mathbf{A}}_\mu(k) \;\vert\; k^\mu k_\mu = 0 \,\, \text{and} \,\, e^\mu k_\mu = 0 \rangle }{ \langle e^\mu \widehat{\mathbf{A}}_\mu(k) \;\vert\; k^\mu k_\mu = 0 \,\, \text{and} \,\, e_\mu \propto k_\mu \rangle }

of those observables whose Fourier modes involve wave polarization vectors $e$ that vanish when contracted with the wave vector $k$ , modulo those whose wave polarization vector $e$ is proportional to the wave vector.

For example if $k = (\kappa, 0, \cdots, \kappa)$ , then the corresponding space of transversal polarization vectors may be identified with $\left\{e \,\vert\, e = (0,e_1, e_2, \cdots, e_{p-1}, 0) \right\}$ .

Proposition

(BRST cohomology on linear on-shell observables of the Gaussian-averaged Lorenz gauge fixed electromagnetic field)

After fixing Gaussian-averaged Lorenz gauge (example ) of the electromagnetic field on Minkowski spacetime, the global BRST cohomology (def. ) on the Gaussian-averaged Lorenz gauge fixed (def. ) on-shell linear observables (def. ) at $deg_{gh} = 0$ (prop. ) is isomorphic to the space of transversally polarized linear observables, def. :

H^0( LinObs( T^\ast \Sigma \times_\Sigma A \times_\Sigma A[-1] \times_\Sigma \mathcal{G}[1], \mathbf{L}' ), s'_{BRST} ) \;\simeq\; LinObs( T^\ast \Sigma, \mathbf{L}_{EM})_{trans} \,.

(e.g. Dermisek 09 II-5, p. 325)

Proof

The gauge fixed BRST differential (206) acts on the Fourier modes of the linear observables (def. ) as follows

\array{ & & s'_{BRST} \\ \array{ \text{antighost} \\ \text{field} } & \widehat{\overline{\mathbf{C}}}(k) &\mapsto& \widehat{\mathbf{B}}(k) & \array{ \text{Nakanishi-Lautrup} \\ \text{field} } \\ \phantom{a} \\ &&& \underset{\text{on-shell}}{=} \tfrac{i}{2} k^\mu \widehat{\mathbf{A}}_\mu(k) & \array{ \text{Lorenz gauge} \\ \text{condition} } \\ \phantom{A} \\ \array{ \text{electromagnetic} \\ \text{field} } & e^\mu \widehat{\mathbf{A}}_\mu(k) &\mapsto& i \left(e^\mu k_\mu\right) \widehat{\mathbf{C}}(k) & \array{ \text{polarization contracted} \\ \text{with wave vector} \\ \text{times ghost field} } \\ \phantom{A} \\ \array{ \text{Nakanishi-Lautrup} \\ \text{field} } & \widehat{\mathbf{B}} &\mapsto& 0 }

This impies that the gauge fixed BRST cohomology on linear on-shell observables at $deg_{gh} = 0$ is the space of transversally polarized linear observables (def. ):

(208)

\begin{aligned} H^0(LinObs(E,\mathbf{L}_{EM}), s'_{BRST}) & = \left\langle \frac{ \left\{ e^\mu \widehat{\mathbf{A}}_{\mu}(k) \,\vert\, k^\mu k_\mu = 0 \,\,\text{and}\,\,0 = d_{BRST}\left( e^\mu \widehat{\mathbf{A}}_\mu(k) \right) = i (e^\mu k_\mu) \widehat{\mathbf{C}}(k) \right\} }{ \left\{ e^\mu \widehat{\mathbf{A}}_\mu(k) \,\vert\, k^\mu k_\mu = 0 \,\,\text{and}\,\, e^\mu \widehat{\mathbf{A}}_\mu(k) \propto s'_{BRST}( \widehat{\overline{\mathbf{C}}}(k) ) = \tfrac{i}{2} k^\mu \widehat{ \mathbf{A} }_\mu(k) \right\} } \right\rangle \\ & = \left\langle \frac{ \left\{ e^\mu \widehat{\mathbf{A}}_\mu(k) \,\vert \, k^\mu k_\mu = 0 \,\, \text{and} \,\, e^\mu k_\mu = 0 \right\} } { \left\{ e^\mu \widehat{\mathbf{A}}_\mu(k) \,\vert \, k^\mu k_\mu = 0 \,\, \text{and} \,\, e^\mu \propto k^\mu \right\} } \right\rangle \\ & = LinObs(T^\ast \Sigma,\mathbf{L}_{EM})_{trans} \end{aligned}

Here the first line is the definition of cochain cohomology (using that both $\widehat{\mathbf{B}}$ and $\widehat{\overline{\mathbf{C}}}$ are immediately seen to vanish in cohomology), the second line is spelling out the action of the BRST operator and using the on-shell relations (205) for $\widehat{\mathbf{B}}$ and the last line is by def. .

As a corollary we obtain:

Proposition

(BRST cohomology on polynomial on-shell observables of the Gaussian-averaged Lorenz gauge fixed electromagnetic field)

After fixing Gaussian-averaged Lorenz gauge (example ) of the electromagnetic field on Minkowski spacetime, the global BRST cohomology (def. ) on the Gaussian-averaged Lorenz gauge fixed (def. ) polynomial on-shell observables (def. ) at $deg_{gh} = 0$ (prop. ) is isomorphic to the distributional polynomial algebra on transversally polarized linear observables, def. :

(209)

H^0(PolyObs( T^\ast \Sigma \times_\Sigma \mathcal{G}[1] \times_\Sigma A \times_\Sigma A[-1] ,\mathbf{L}), s'_{BRST}) \;\simeq\; Sym\left( LinObs(T^\ast \Sigma,\mathbf{L}_{EM})_{trans} \right)

Proof

Generally, if $(V^\bullet,d)$ is a cochain complex over a ground field of characteristic zero (such as the real numbers in the present case) and $Sym(V^\bullet,d)$ the differential graded-symmetric algebra that it induces (this example), then

H^\bullet(Sym(V,d)) = Sym(H^\bullet(V,d)) \,.

(by this prop.).

In conclusion we finally obtain:

Proposition

(gauge-invariant polynomial on-shell observables of the free field theory electromagnetic field)

The BV-BRST cohomology on infinitesimal observables (def. ) of the free electromagnetic field on Minkowski spacetime (example ) at $deg_{gh} = 0$ is the distributional polynomial algebra in the transversally polarized linear on-shell observables, def. , as in prop. .

Proof

By the classes of quasi-isomorphisms of prop. and prop. we may equivalently compute the cohomology if the BV-BRST complex with differential $s'$ , obtained after Gaussian-averaged Lorenz gauge fixing from example . Since the equations of motion (205) are manifestly Green hyperbolic differential equations after this gauge fixing Cauchy surfaces for the equations of motion exist and hence prop. together with prop. implies that the gauge fixed BV-complex $s'_{BV}$ has its cohomology concentrated in degree zero on the on-shell observables. Therefore prop. (i.e. the collapsing of the spectral sequence for the BV/BRST bicomplex) implies that the gauge fixed BV-BRST cohomology at ghost number zero is given by the on-shell BRST-cohomology. This is characterized by prop. .

$\,$

This concludes our discussion of gauge fixing. With the covariant phase space for gauge theories obtained thereby, we may finally pass to the quantization of field theory to quantum field theory proper, in the next chapter.

Quantization

In this chapter we discuss the following topics:

Motivation from Lie theory
Geometric quantization
Moyal star products
Moyal star product as deformation quantization
Moyal star product via geometric quantization
Example: Wick algebra of normal ordered product on Kähler vector space
Star-product on regular polynomial observables in field theory

$\,$

In the previous chapters we had found the Peierls-Poisson bracket (theorem ) on the covariant phase space (prop. ) of a gauge fixed (def. ) free Lagrangian field theory (def. ).

This Poisson bracket (def. below) is a Lie bracket and hence reflects infinitesimal symmetries acting on the covariant phase space. Just as with the infinitesimal symmetries of the Lagrangian and the BRST-reduced field bundle (example ), we may hard-wire these Hamiltonian symmetries into the very geometry of the phase space by forming their homotopy quotient given by the corresponding Lie algebroid (def. ): here this is called the Poisson Lie algebroid. Its Lie integration to a finite (instead of infinitesimal) structure is called the symplectic groupoid. This is the original covariant phase space, but with its Hamiltonian flows hard-wired into its higher differential geometry (Bongers 14, section 4).

Where smooth functions on the plain covariant phase space form the commutative algebra of observables under their pointwise product (def. ), the smooth functions on this symplectic groupoid-refinement of the phase space are multiplied by the groupoid convolution product and as such become a non-commutative algebra of quantum observables. This passage from the commutative to the non-commutative algebra of observables is called quantization, here specifically geometric quantization of symplectic groupoids (Hawkins 04, Nuiten 13).

Instead of discussing this in generality, we here focus right away on the simple special case relevant for the quantization of gauge fixed free Lagrangian field theories in the next chapter.

After an informal motivation of geometric quantization from Lie theory below (for a self-contained introduction see Bongers 14), we first showcase geometric quantization by discussing how the archetypical example of quantum mechanics in the Schrödinger representation arises from the polarized action of the Poisson bracket Lie algebra (example below). With the concept of polarization thus motivated, we use this to find the polarized groupoid convolution algebra of the symplectic groupoid of a free theory (prop. below).

The result is the “Moyal-star product” (def. below). This is the exponentiation of the integral kernel of the Poisson bracket plus possibly a symmetric shift (prop. below); it turns out to be (example below) a formal deformation quantization of the original commutative pointwise product (def. below).

Below we spell out the (elementary) proofs of these statements for the case of phase spaces which are finite dimensional vector spaces. But these proofs manifestly depend only on elementary algebraic properties of polynomials and hence go through in more general contexts as long as these basic algebraic properties are retained.

In the context of free Lagrangian field theory the analogue of the formal power series algebras on a linear phase space is, a priori, the algebra of polynomial observables (def. ). These are effectively polynomials in the field observables $\mathbf{\Phi}^a(x)$ (def. ) whose coefficients, however, are distributions of several variables. By microlocal analysis, such polynomial distributions do satisfy the usual algebraic properties of ordinary polynomials if potential UV-divergences (remark ) encoded in their wave front set (def. ) vanish, according to Hörmander's criterion (prop. ).

This criterion is always met on the subspace of regular polynomial observables and hence every propagator induces a star product on these (prop. below). In particular thus the star product of the causal propagator of a gauge fixed free Lagrangian field theory is a formal deformation quantization of its algebra of regular polynomial observables (cor. below). In order to extend this to local observables one may appeal to a certain quantization freedom (prop. below) and shift the causal propagator by a symmetric contribution, such that it becomes the Wightman propagator; this is the topic of the following chapters (remark at the end below).

In conclusion, for free gauge fixed Lagrangian field theory the product in the algebra of quantum observables is given by exponentiating propagators. It is the combinatorics of these exponentiated propagator expressions that yields the hallmark structures of perturbative quantum field theory, namely the combinatorics of Wick's lemma for the Wick algebra of free fields, and the combinatorics of Feynman diagrams for the time-ordered products. This is the topic of the following chapters Free quantum fields and Scattering. Here we conclude just with discussing the finite-dimensional toy version of the normal-ordered product in the Wick algebra (example below).

$\,$

motivation from Lie theory

Quantization of course was and is motivated by experiment, hence by observation of the observable universe: it just so happens that quantum mechanics and quantum field theory correctly account for experimental observations where classical mechanics and classical field theory gives no answer or incorrect answers. A historically important example is the phenomenon called the “ultraviolet catastrophe”, a paradox predicted by classical statistical mechanics which is not observed in nature, and which is corrected by quantum mechanics.

But one may also ask, independently of experimental input, if there are good formal mathematical reasons and motivations to pass from classical mechanics to quantum mechanics. Could one have been led to quantum mechanics by just pondering the mathematical formalism of classical mechanics?

The following spells out an argument to this effect. It will work for readers with a background in modern mathematics, notably in Lie theory, and with an understanding of the formalization of classical/prequantum mechanics in terms of symplectic geometry.

So to briefly recall, a system of classical mechanics/prequantum mechanics is a phase space, formalized as a symplectic manifold $(X, \omega)$ . A symplectic manifold is in particular a Poisson manifold, which means that the algebra of functions on phase space $X$ , hence the algebra of classical observables, is canonically equipped with a compatible Lie bracket: the Poisson bracket. This Lie bracket is what controls dynamics in classical mechanics. For instance if $H \in C^\infty(X)$ is the function on phase space which is interpreted as assigning to each configuration of the system its energy – the Hamiltonian function – then the Poisson bracket with $H$ yields the infinitesimal time evolution of the system: the differential equation famous as Hamilton's equations.

Something to take notice of here is the infinitesimal nature of the Poisson bracket. Generally, whenever one has a Lie algebra $\mathfrak{g}$ , then it is to be regarded as the infinitesimal approximation to a globally defined object, the corresponding Lie group (or generally smooth group) $G$ . One also says that $G$ is a Lie integration of $\mathfrak{g}$ and that $\mathfrak{g}$ is the Lie differentiation of $G$ .

Therefore a natural question to ask is: Since the observables in classical mechanics form a Lie algebra under Poisson bracket, what then is the corresponding Lie group?

The answer to this is of course “well known” in the literature, in the sense that there are relevant monographs which state the answer. But, maybe surprisingly, the answer to this question is not (at time of this writing) a widely advertized fact that has found its way into the basic educational textbooks. The answer is that this Lie group which integrates the Poisson bracket is the “quantomorphism group”, an object that seamlessly leads to the quantum mechanics of the system.

Before we spell this out in more detail, we need a brief technical aside: of course Lie integration is not quite unique. There may be different global Lie group objects with the same Lie algebra.

The simplest example of this is already one of central importance for the issue of quantization, namely, the Lie integration of the abelian line Lie algebra $\mathbb{R}$ . This has essentially two different Lie groups associated with it: the simply connected translation group, which is just $\mathbb{R}$ itself again, equipped with its canonical additive abelian group structure, and the discrete quotient of this by the group of integers, which is the circle group

U(1) = \mathbb{R}/\mathbb{Z} \,.

Notice that it is the discrete and hence “quantized” nature of the integers that makes the real line become a circle here. This is not entirely a coincidence of terminology, but can be traced back to the heart of what is “quantized” about quantum mechanics.

Namely, one finds that the Poisson bracket Lie algebra $\mathfrak{poiss}(X,\omega)$ of the classical observables on phase space is (for $X$ a connected manifold) a Lie algebra extension of the Lie algebra $\mathfrak{ham}(X)$ of Hamiltonian vector fields on $X$ by the line Lie algebra:

\mathbb{R} \longrightarrow \mathfrak{poiss}(X,\omega) \longrightarrow \mathfrak{ham}(X) \,.

This means that under Lie integration the Poisson bracket turns into an central extension of the group of Hamiltonian symplectomorphisms of $(X,\omega)$ . And either it is the fairly trivial non-compact extension by $\mathbb{R}$ , or it is the interesting central extension by the circle group $U(1)$ . For this non-trivial Lie integration to exist, $(X,\omega)$ needs to satisfy a quantization condition which says that it admits a prequantum line bundle. If so, then this $U(1)$ -central extension of the group $Ham(X,\omega)$ of Hamiltonian symplectomorphisms exists and is called… the quantomorphism group $QuantMorph(X,\omega)$ :

U(1) \longrightarrow QuantMorph(X,\omega) \longrightarrow Ham(X,\omega) \,.

While important, for some reason this group is not very well known, which is striking because it contains a small subgroup which is famous in quantum mechanics: the Heisenberg group.

More precisely, whenever $(X,\omega)$ itself has a compatible group structure, notably if $(X,\omega)$ is just a symplectic vector space (regarded as a group under addition of vectors), then we may ask for the subgroup of the quantomorphism group which covers the (left) action of phase space $(X,\omega)$ on itself. This is the corresponding Heisenberg group $Heis(X,\omega)$ , which in turn is a $U(1)$ -central extension of the group $X$ itself:

U(1) \longrightarrow Heis(X,\omega) \longrightarrow X \,.

At this point it is worth pausing for a second to note how the hallmark of quantum mechanics has appeared as if out of nowhere simply by applying Lie integration to the Lie algebraic structures in classical mechanics:

if we think of Lie integrating $\mathbb{R}$ to the interesting circle group $U(1)$ instead of to the uninteresting translation group $\mathbb{R}$ , then the name of its canonical basis element $1 \in \mathbb{R}$ is canonically “ $i$ ”, the imaginary unit. Therefore one often writes the above central extension instead as follows:

i \mathbb{R} \longrightarrow \mathfrak{poiss}(X,\omega) \longrightarrow \mathfrak{ham}(X,\omega)

in order to amplify this. But now consider the simple special case where $(X,\omega) = (\mathbb{R}^2, d p \wedge d q)$ is the 2-dimensional symplectic vector space which is for instance the phase space of the particle propagating on the line. Then a canonical set of generators for the corresponding Poisson bracket Lie algebra consists of the linear functions $p$ and $q$ of classical mechanics textbook fame, together with the constant function. Under the above Lie theoretic identification, this constant function is the canonical basis element of $i \mathbb{R}$ , hence purely Lie theoretically it is to be called “ $i$ ”.

With this notation then the Poisson bracket, written in the form that makes its Lie integration manifest, indeed reads

[q,p] = i \,.

Since the choice of basis element of $i \mathbb{R}$ is arbitrary, we may rescale here the $i$ by any non-vanishing real number without changing this statement. If we write “ $\hbar$ ” for this element, then the Poisson bracket instead reads

[q,p] = i \hbar \,.

This is of course the hallmark equation for quantum physics, if we interpret $\hbar$ here indeed as Planck's constant. We see it arises here merely by considering the non-trivial (the interesting, the non-simply connected) Lie integration of the Poisson bracket.

This is only the beginning of the story of quantization, naturally understood and indeed “derived” from applying Lie theory to classical mechanics. From here the story continues. It is called the story of geometric quantization. We close this motivation section here by some brief outlook.

The quantomorphism group which is the non-trivial Lie integration of the Poisson bracket is naturally constructed as follows: given the symplectic form $\omega$ , it is natural to ask if it is the curvature 2-form of a $U(1)$ -principal connection $\nabla$ on complex line bundle $L$ over $X$ (this is directly analogous to Dirac charge quantization when instead of a symplectic form on phase space we consider the the field strength 2-form of electromagnetism on spacetime). If so, such a connection $(L, \nabla)$ is called a prequantum line bundle of the phase space $(X,\omega)$ . The quantomorphism group is simply the automorphism group of the prequantum line bundle, covering diffeomorphisms of the phase space (the Hamiltonian symplectomorphisms mentioned above).

As such, the quantomorphism group naturally acts on the space of sections of $L$ . Such a section is like a wavefunction, except that it depends on all of phase space, instead of just on the “canonical coordinates”. For purely abstract mathematical reasons (which we won’t discuss here, but see at motivic quantization for more) it is indeed natural to choose a “polarization” of phase space into canonical coordinates and canonical momenta and consider only those sections of the prequantum line bundle which depend only on the former. These are the actual wavefunctions of quantum mechanics, hence the quantum states. And the subgroup of the quantomorphism group which preserves these polarized sections is the group of exponentiated quantum observables. For instance in the simple case mentioned before where $(X,\omega)$ is the 2-dimensional symplectic vector space, this is the Heisenberg group with its famous action by multiplication and differentiation operators on the space of complex-valued functions on the real line.

$\,$

geometric quantization

We had seen that every Lagrangian field theory induces a presymplectic current $\Omega_{BFV}$ (def. ) on the jet bundle of its field bundle in terms of which there is a concept of Hamiltonian differential forms and Hamiltonian vector fields on the jet bundle (def. ). The concept of quantization is induced by this local phase space-structure.

In order to disentangle the core concept of quantization from the technicalities involved in fully fledged field theory, we now first discuss the finite dimensional situation.

Example

(Schrödinger representation via geometric quantization)

Consider the Cartesian space $\mathbb{R}^2$ (def. ) with canonical coordinate functions denoted $\{q,p\}$ and to be called the canonical coordinate $q$ and its canonical momentum $p$ (as in example ) and equipped with the constant differential 2-form given in in (60) by

(210)

\omega = d p \wedge d q \,.

This is closed in that $d \omega = 0$ , and invertible in that the contraction of tangent vector fields into it (def. ) is an isomorphism to differential 1-forms, and as such it is a symplectic form.

A choice of presymplectic potential for this symplectic form is

(211)

\theta \;\coloneqq\; - q \, d p

in that $d \theta = \omega$ . (Other choices are possible, notably $\theta = p \, d q$ ).

For

A \;\colon\; \mathbb{R}^2 \longrightarrow \mathbb{C}

a smooth function (an observable), we say that a Hamiltonian vector field for it (as in def. ) is a tangent vector field $v_A$ (example ) whose contraction (def. ) into the symplectic form (210) is the de Rham differential of $A$ :

(212)

\iota_{v_A} \omega = d A \,.

Consider the foliation of this phase space by constant- $q$ -slices

(213)

\Lambda_q \subset \mathbb{R}^2 \,.

These are also called the leaves of a real polarization of the phase space.

(Other choices of polarization are possible, notably the constant $p$ -slices.)

We says that a smooth function

\psi \;\colon\; \mathbb{R}^2 \longrightarrow \mathbb{C}

is polarized if its covariant derivative with connection on a bundle $i \theta$ along the leaves vanishes; which for the choice of polarization in (213) means that

\nabla_{\partial_p} \psi = 0 \phantom{AAA} \Leftrightarrow \phantom{AAA} \iota_{\partial_p} \left( d \psi + i \theta \psi \right) = 0 \,,

which in turn, for the choice of presymplectic potential in (211), means that

\frac{\partial}{\partial p} \psi - i q \psi = 0 \,.

The solutions to this differential equation are of the form

(214)

\Psi(q,p) = \psi(q) \, \exp(+ i p q)

for $\psi \colon \mathbb{R} \to \mathbb{C}$ any smooth function, now called a wave function.

This establishes a linear isomorphism between polarized smooth functions and wave functions.

By (212) we have the Hamiltonian vector fields

v_q = \partial_p \phantom{AAAA} v_p = -\partial_q \,.

The corresponding Poisson bracket is

(215)

\begin{aligned} \{q,p\} & \coloneqq \iota_{v_p} \iota_{v_q} \omega \\ & = -\iota_{\partial_q} \iota_{\partial_p} d p \wedge d q = \\ & = - 1 \end{aligned}

The action of the corresponding quantum operators $\hat q$ and $\hat p$ on the polarized functions (214) is as follows

\begin{aligned} \hat q \Psi(q,p) & = - i \nabla_{\partial_p}\Psi(q,p) + q \Psi(q,p) \\ & = -i \underset{ = 0 }{ \underbrace{ \left( \underset{ = i q \Psi(q,p) }{ \underbrace{ \frac{\partial}{\partial p} \left( \psi(q) e^{i q p} \right) } } - i q \Psi(q,p) \right) } } + q \Psi(q,p) \\ & = \left( q \psi(q) \right) e^{i q p} \end{aligned}

and

\begin{aligned} \hat p \Psi(q,p) & = i \nabla_{\partial_q} \Psi(q,p) + p \Psi(q,p) \\ & = i \frac{\partial}{\partial q} (\psi(q)e^{i q p}) + p \Psi(q,p) \\ & = \left( i \frac{\partial}{\partial q}\psi(q) \right) e^{i q p} + \underset{ = 0}{ \underbrace{ \underset{ = - p \Psi(q,p) }{ \underbrace{ \psi(q) \left( i \frac{\partial}{\partial q} e^{i q p} \right) } } + p \Psi(q,p) } } \\ & = \left( i \frac{\partial}{\partial q}\psi(q) \right) e^{i p q} \end{aligned} \,.

Hence under the identification (214) we have

\hat q \psi = q \psi \phantom{AAAA} \hat p \psi = i \frac{\partial}{\partial q} \psi \,.

This is called the Schrödinger representation of the canonical commutation relation (215).

$\,$

Moyal star products

Let $V$ be a finite dimensional vector space and let $\pi \in V \otimes V$ be an element of the tensor product (not necessarily skew symmetric at the moment).

We may canonically regard $V$ as a smooth manifold, in which case $\pi$ is canonically regarded as a constant rank-2 tensor. As such it has a canonical action by forming derivatives on the tensor product of the space of smooth functions:

\pi \;\colon\; C^\infty(V) \otimes C^\infty(V) \longrightarrow C^\infty(V) \otimes C^\infty(V) \,.

If $\{\partial_i\}$ is a linear basis for $V$ , identified, as before, with a basis for $\Gamma(T V)$ , then in this basis this operation reads

\pi(f \otimes g) \;=\; \pi^{i j} (\partial_i f) \otimes (\partial_j g) \,,

where $\partial_i f \coloneqq \frac{\partial f}{\partial x^i}$ denotes the partial derivative of the smooth function $f$ along the $i$ th coordinate, and where we use the Einstein summation convention.

For emphasis we write

\array{ C^\infty(V) \otimes C^\infty(V) &\overset{prod}{\longrightarrow}& C^\infty(V) \\ f \otimes g &\mapsto& f \cdot g }

for the pointwise product of smooth functions.

Definition

(star product induced by constant rank-2 tensor)

Given $(V,\pi)$ as above, then the star product induced by $\pi$ on the formal power series algebra $C^\infty(V) [ [\hbar] ]$ in a formal variable $\hbar$ (“Planck's constant”) with coefficients in the smooth functions on $V$ is the linear map

(-) \star_\pi (-) \;\colon\; C^\infty(V)[ [ \hbar ] ] \otimes C^\infty(V)[ [ \hbar ] ] \longrightarrow C^\infty(V)[ [\hbar] ]

given by

(-) \star_\pi (-) \;\coloneqq\; prod \circ \exp\left( \hbar \pi^{i j} \frac{\partial}{\partial x^i} \otimes \frac{\partial}{\partial x^j} \right)

Hence

f \star_\pi g \;\coloneqq\; 1 + \hbar \pi^{i j} \frac{\partial f}{\partial x^i} \cdot \frac{\partial g}{\partial x^j} + \hbar^2 \tfrac{ 1 }{2} \pi^{i j} \pi^{k l} \frac{\partial^2 f}{\partial x^{i} \partial x^{k}} \cdot \frac{\partial^2 g}{\partial x^{j} \partial x^{l}} + \cdots \,.

Example

(star product degenerating to pointwise product)

If $\pi = 0$ in def. , then the star product $\star_0 = \cdot$ is the plain pointwise product of functions.

Example

(Moyal star product)

If the tensor $\pi$ in def. is skew-symmetric, it may be regarded as a constant Poisson tensor on the smooth manifold $V$ . In this case $\star_{\tfrac{1}{2}\pi}$ is called a Moyal star product and the star-product algebra $C^\infty(V)[ [\hbar] ], \star_\pi)$ is called the Moyal deformation quantization of the Poisson manifold $(V,\pi)$ .

Proposition

(star product is associative and unital)

Given $(V,\pi)$ as above, then the star product $(-) \star_\pi (-)$ from def. is associative and unital with unit the constant function $1 \in C^\infty(V) \hookrightarrow C^\infty(V)[ [ \hbar ] ]$ .

Hence the vector space $C^\infty(V)$ equipped with the star product $\pi$ is a unital associative algebra.

Proof

Observe that the product rule of differentiation says that

\partial_i \circ prod = prod \circ ( \partial_i \otimes id \;+\; id \otimes \partial_i ) \,.

Using this we compute as follows:

\begin{aligned} & (f \star_\pi g) \star_\pi h \\ & = prod \circ \exp( \pi^{i j} \partial_i \otimes \partial_j ) \circ \left( \left( prod \circ \exp( \pi^{k l} \partial_k \otimes \partial_l ) \right) \otimes id \right) (f \otimes g \otimes g) \\ & = prod \circ \exp( \pi^{i j} \partial_i \otimes \partial_j ) \circ (prod \otimes id) \circ \left( \exp( \pi^{k l} \partial_k \otimes \partial_l ) \otimes id \right) (f \otimes g \otimes g) \\ & = prod \circ (prod \otimes id) \circ \exp( \pi^{i j} ( \partial_i \otimes id \otimes \partial_j +id \otimes \partial_i \otimes \partial_j ) \circ \exp( \pi^{k l} \partial_k \otimes \partial_l ) \otimes id (f \otimes g \otimes g) \\ & = prod \circ (prod \otimes id) \circ \exp( \pi^{i j} \partial_i \otimes id \otimes \partial_j ) \circ \exp( \pi^{i j} id \otimes \partial_i \otimes \partial_j ) \circ \exp( \pi^{k l} \partial_k \otimes \partial_l \otimes id ) (f \otimes g \otimes g) \\ & = prod_3 \circ \exp( \pi^{i j} ( \partial_i \otimes \partial_j \otimes id + \partial_i \otimes id \otimes \partial_j + id \otimes \partial_i \otimes \partial_j) ) \end{aligned}

In the last line we used that the ordinary pointwise product of functions is associative, and wrote $prod_3 \colon C^\infty(V) \otimes C^\infty(V) \otimes C^\infty(V) \to C^\infty(V)$ for the unique pointwise product of three functions.

The last expression above is manifestly independent of the choice of order of the arguments in the triple star product, and hence it is clear that an analogous computation yields

\cdots = f \star_\pi (g \star_\pi h) \,.

Proposition

(shift by symmetric contribution is isomorphism of star products)

Let $V$ be a vector space, $\pi \in V \otimes V$ a rank-2 tensor and $\alpha \in Sym(V \otimes V)$ a symmetric rank-2 tensor.

Then the linear map

\array{ C^\infty(V) &\overset{\exp\left(\tfrac{1}{2}\alpha \right)}{\longrightarrow}& C^\infty(V) \\ f &\mapsto& \exp\left( \tfrac{1}{2}\hbar \alpha^{i j} \partial_i \partial_j \right) f }

constitutes an isomorphism of star product algebras (prop. ) of the form

\exp\left(\hbar\tfrac{1}{2}\hbar\alpha \right) \;\colon\; (C^\infty(V)[ [\hbar] ], \star_{\pi}) \overset{\simeq}{\longrightarrow} (C^\infty(V))[ [\hbar] ], \star_{\pi + \alpha}) \,,

hence identifying the star product induced from $\pi$ with that induced from $\pi + \alpha$ .

In particular every star product algebra $(C^\infty(V)[ [\hbar] ],\star_\pi)$ is isomorphic to a Moyal star product algebra $\star_{\tfrac{1}{2}\pi}$ (example ) with $\tfrac{1}{2}\pi_{skew}^{i j} = \tfrac{1}{2}(\pi^{i j} - \pi^{j i})$ the skew-symmetric part of $\pi$ , this isomorphism being exhibited by the symmetric part $2\alpha^{i j} = \tfrac{1}{2}(\pi^{i j} + \pi^{j i})$ .

Proof

We need to show that

\array{ C^\infty(V)[ [\hbar] ] \otimes C^\infty(V)[ [\hbar] ] & \overset{ \exp\left( \tfrac{1}{2}\hbar \alpha \right) \otimes \exp\left( \tfrac{1}{2}\hbar \alpha \right) }{\longrightarrow}& C^\infty(V)[ [\hbar] ] \otimes C^\infty(V)[ [\hbar] ] \\ {}^{\mathllap{\star_{\pi}}}\downarrow && \downarrow^{\mathrlap{\star_{\pi + \alpha}}} \\ C^\infty(V)[ [\hbar] ] &\underset{\exp\left( \tfrac{1}{2} \alpha \right) }{\longrightarrow}& C^\infty(V)[ [\hbar] ] }

hence that

prod \circ \exp( \hbar(\pi + \alpha) ) \circ \left( \exp\left( \tfrac{1}{2}\alpha\right) \otimes \exp\left( \tfrac{1}{2}\alpha \right) \right) \;=\; \exp\left( \tfrac{1}{2}\alpha \right) \circ prod \circ \exp( \pi ) \,.

To this end, observe that the product rule of differentiation applied twice in a row implies that

\partial_i \partial_j \circ prod \;=\; prod \circ \left( (\partial_i \partial_j) \otimes id + id \otimes (\partial_i \partial_j) + \partial_i \otimes \partial_j + \partial_j \otimes \partial_i \right) \,.

Using this we compute

\begin{aligned} & \exp\left( \hbar\tfrac{1}{2}\alpha^{i j} \partial_i \partial_j \right) \circ prod \circ \exp( \hbar \pi^{i j} \partial_{i} \otimes \partial_j ) \\ & = prod \circ \exp\left( \hbar \tfrac{1}{2}\alpha^{i j} \left( (\partial_i \partial_j) \otimes id + id \otimes (\partial_i \partial_j) + \partial_i \otimes \partial_j + \partial_j \otimes \partial_i \right) \right) \circ \exp( \hbar \pi^{i j} \partial_{k} \otimes \partial_l ) \\ & = prod \circ \exp\left( \hbar (\pi^{i j} + \alpha^{i j}) \partial_i \otimes \partial_j \right) \circ \exp\left( \hbar \tfrac{1}{2} \alpha^{i j} (\partial_i \partial_j) \otimes id \hbar \tfrac{1}{2} \alpha^{i j} id \otimes (\partial_i \partial_j) \right) \\ & = prod \circ \exp\left( \hbar (\pi^{i j} + \alpha^{i j}) \partial_i \otimes \partial_j \right) \circ \left( \exp\left( \tfrac{1}{2} \hbar \alpha \right) \otimes \exp\left( \tfrac{1}{2} \hbar \alpha \right) \right) \end{aligned}

$\,$

Moyal star product as deformation quantization

Definition

(super-Poisson algebra)

A super-Poisson algebra is

a supercommutative algebra $\mathcal{A}$ (here: over the real numbers)
a bilinear function

$\{-,-\} \;\colon\; \mathcal{A} \otimes \mathcal{A} \longrightarrow A$

to be called the Poisson bracket

such that

$\{-,-\}$ is a super Lie bracket on $\mathcal{A}$ , hence it
1. is graded skew-symmetric;
2. satisfies the super-Jacobi identity;
for each $A \in \mathcal{A}$ of homogeneous degree, the operation

$\left\{ A, -\right\} \;\colon\; \mathcal{A} \longrightarrow \mathcal{A}$

is a graded derivation on $\mathcal{A}$ of the same degree as $A$ .

Definition

(formal deformation quantization)

Let $(\mathcal{A},\{-,-\})$ be a super-Poisson algebra (def. ). Then a formal deformation quantization of $(A,\{-,-\})$ is

the structure of an associative algebra on the formal power series algebra over $\mathcal{A}$ in a variable to be called $\hbar$ , hence an associative and unital product

$\mathcal{A}[ [\hbar] ] \otimes \mathcal{A}[ [\hbar] ] \longrightarrow \mathcal{A}[ [\hbar] ]$

such that for all $f,g \in \mathcal{A}$ of homogeneous degree we have

$f \star g \, mod \hbar = f g$
$f \star g - (-1)^{deg(f) deg(g)} g \star f \, \mod \hbar^2 = \hbar \{f,g\}$

meaning that

to zeroth order in $\hbar$ the star product coincides with the given commutative product on $\mathcal{A}$ ,
to first order in $\hbar$ the graded commutator of the star product coincides with the given Poisson bracket on $\mathcal{A}$ .

Example

(Moyal star product is formal deformation quantization)

Let $(V,\pi)$ be a Poisson vector space, hence a vector space $V$ , equipped with a skew-symmetric tensor $\pi \in V \wedge V$ .

Then with $V$ regarded as a smooth manifold, the algebra of smooth functions $C^\infty(X)$ (def. ) becomes a Poisson algebra (def. ) with Poisson bracket given by

\{f,g\} \;\coloneqq\; \pi^{i j} \frac{\partial f}{\partial x^i} \frac{\partial g}{\partial x^j} \,.

Moreover, for every symmetric tensor $\alpha \in V \otimes V$ , the Moyal star product associated with $\tfrac{1}{2}\pi + \alpha$

\array{ C^\infty(V)[ [\hbar] ] \otimes C^\infty(V)[ [\hbar] ] &\overset{\star_{\tfrac{1}{2}\pi + \alpha}}{\longrightarrow}& C^\infty(V)[ [\hbar] ] \\ (f,g) &\mapsto& ((-)\cdot (-)) \circ \exp\left( (\tfrac{1}{2}\pi^{i j} + \alpha^{i j}) \frac{\partial}{\partial x^i} \otimes \frac{\partial}{\partial x^j} \right\} } (f,g)

is a formal deformation quantization (def. ) of this Poisson algebra-structure.

$\,$

Moyal star product via geometric quantization of symplectic groupoid

Proposition

(integral representation of star product)

If $\pi$ skew-symmetric and invertible, in that there exists $\omega \in V^\ast \otimes V^\ast$ with $\pi^{i j}\omega_{j k} = \delta^i_k$ , and if the functions $f,g$ admit Fourier analysis (are functions with rapidly decreasing partial derivatives), then their star product (def. ) is equivalently given by the following integral expression:

\begin{aligned} \left(f \star_\pi g\right)(x) &= \frac{(det(\omega)^{2n})}{(2 \pi \hbar)^{2n} } \int e^{ \tfrac{1}{i \hbar} \omega((x - \tilde y),(x-y))} f(y) g(\tilde y) \, d^{2 n} y \, d^{2 n} \tilde y \end{aligned}

(Baker 58)

Proof

We compute as follows:

\begin{aligned} \left(f \star_\pi g\right)(x) & \coloneqq prod \circ \exp\left( \hbar \pi^{i j} \frac{\partial}{\partial x^i} \otimes \frac{\partial}{\partial x^j} \right)(f, g) \\ & = \frac{1}{(2 \pi)^{2n}} \frac{1}{(2 \pi)^{2n}} \int \int \underbrace{ e^{ i \hbar \pi(k,q) } } \underbrace{ e^{i k \cdot (x-y)} f(y) } \underbrace{ e^{i q \cdot (x- \tilde y)} g(\tilde y) } \, d^{2 n} k \, d^{2 n} q \, d^{2 n} y \, d^{2 n} \tilde y \\ & = \frac{1}{(2 \pi)^{2n}} \int \delta\left( x - \tilde y + \hbar \pi \cdot k \right) e^{i k \cdot (x-y)} f(y) g(\tilde y) \, d^{2 n} k \, d^{2 n} y \, d^{2 n} \tilde y \\ & = \frac{1}{(2 \pi)^{2n}} \int \delta\left( x - \tilde y + z \right) e^{ \tfrac{i}{\hbar} \omega(z, (x-y))} f(y) g(\tilde y) \, d^{2 n} z \, d^{2 n} y \, d^{2 n} \tilde y \\ & = \frac{(det(\pi)^{2n})}{(2 \pi \hbar)^{2n} } \int e^{\tfrac{1}{i \hbar}\omega((x - \tilde y),(x-y))} f(y) g(\tilde y) \, d^{2 n} y \, d^{2 n} \tilde y \end{aligned}

Here in the first step we expressed $f$ and $g$ both by their Fourier transform (inserting the Fourier expression of the delta distribution from this example) and used that under this transformation the partial derivative $\pi^{a b} \frac{\partial}{\partial\phi^a}{\frac{\partial}{\phi^b}}$ turns into the product with $i \pi^{i j} k_i k_j$ (this prop.). Then we identified again the Fourier-expansion of a delta distribution and finally we applied the change of integration variables $k = \tfrac{1}{\hbar}\omega \cdot z$ and then evaluated the delta distribution.

Next we express this as the groupoid convolution product of polarized sections of the symplectic groupoid. To this end, we first need the following definnition:

Definition

(symplectic groupoid of symplectic vector space)

Assume that $\pi$ is the inverse of a symplectic form $\omega$ on $\mathbb{R}^{2n}$ . Then the Cartesian product

\array{ && \mathbb{R}^{2n} \times \mathbb{R}^{2n} \\ & {}^{\mathllap{pr_1}}\swarrow && \searrow^{\mathrlap{pr_2}} \\ \mathbb{R}^{2n} && && \mathbb{R}^{2n} }

inherits the symplectic structure

\Omega \;\coloneqq\; \left( pr_1^\ast \omega - pr_2^\ast \omega \right)

given by

\begin{aligned} \Omega & = \omega_{i j} d x^i \wedge d x^j - \omega_{i j} d y^i \wedge d y^j \\ & = \omega_{i j} ( d x^i - d y^i ) \wedge ( d x^j + d y^j ) \end{aligned} \,.

The pair groupoid on $\mathbb{R}^{2n}$ equipped with this symplectic form on its space of morphisms is a symplectic groupoid.

A choice of potential form $\Theta$ for $\Omega$ , hence with $\Omega = d \Theta$ , is given by

\Theta \coloneqq -\omega_{i j} ( x^i + y^i ) d (x^j - y^j) )

Choosing the real polarization spanned by $\partial_{x^i} - \partial_{y^i}$ a polarized section is function $F = F(x,y)$ such that

\iota_{\partial_{x^j} - \partial_{y^j}}(d F - \tfrac{1}{i \hbar} \tfrac{1}{4} \Theta F) = 0

hence

(216)

F(x,y) = f\left( \tfrac{x + y}{2} \right) e^{ \tfrac{1}{i \hbar} \omega\left( \tfrac{x - y}{2} , \tfrac{x + y}{2} \right)} \,.

Proposition

(polarized symplectic groupoid convolution product of symplectic vector space is given by Moyal star product)

Given a symplectic vector space $(\mathbb{R}^{2n}, \omega)$ , then the groupoid convolution product on polarized sections (216) on its symplectic groupoid (def. ), given by convolution product followed by averaging (integration) over the polarization fiber, is given by the star product (def. ) for the corresponding Poisson tensor $\pi \coloneqq \omega^{-1}$ , in that

\begin{aligned} \int \int F(x,t) G(t,y) \, d^{2n} t \, d^{2n} (x-y) & = (f \star_\pi g)((x+y)/2) \end{aligned} \,.

(Weinstein 91, p. 446, Garcia-Bondia & Varilly 94, section V, Hawkins 06, example6.2)

Proof

We compute as follows:

\begin{aligned} & \int \int F(x,t) G(t,y) \, d^{2n} t \, d^{2n} (x-y) \\ & \coloneqq \int \int f((x + t)/2) g( (t + y)/2 ) e^{ \tfrac{1}{i \hbar} \tfrac{1}{4} \omega( x-t, x+t ) + \tfrac{1}{i \hbar} \tfrac{1}{4} \omega(t-y, t + y) } \, d^{2n} t \, d^{2n} (x-y) \\ & = \int \int f(t/2) g( (t - (x - y))/2 ) e^{ \tfrac{1}{i \hbar} \tfrac{1}{4} \omega( (x+y) + (x - y) - t, t ) + \tfrac{1}{i \hbar} \tfrac{1}{4} \omega(t-(x+y), t - (x-y)) } \, d^{2n} t \, d^{2n} (x-y) \\ & = \int \int f(t/2) g( \tilde t / 2) e^{ \tfrac{1}{i \hbar} \tfrac{1}{4} \omega( (x+y) - \tilde t, t ) - \tfrac{1}{i \hbar} \tfrac{1}{4} \omega((x+y)-t, \tilde t) } \, d^{2n} t \, d^{2n} \tilde t \\ & = \int \int f(t) g( \tilde t ) e^{ \tfrac{1}{i \hbar} \tfrac{1}{4} \omega( (x+y) - 2 \tilde t, 2 t ) - \tfrac{1}{ii \hbar} \tfrac{1}{4} \omega((x+y)- 2 t, 2 \tilde t) } \, d^{2n} t \, d^{2n} \tilde t \\ \\ & = \int \int f(t) g(\tilde t ) e^{ \tfrac{1}{i \hbar} \omega\left( \tfrac{1}{2}(x+y) - \tilde t, \tfrac{1}{2}(x + y) - t \right)} \, d^{2n} t \, d^{2n} \tilde t \\ & = (f \star_\omega g)((x+y)/2) \end{aligned}

The first line just unwinds the definition of polarized sections from def. , the following lines each implement a change of integration variables and finally in the last line we used prop. .

$\,$

Example: Wick algebra of normal ordered products on Kähler vector space

Definition

(Kähler vector space)

An Kähler vector space is a real vector space $V$ equipped with a linear complex structure $J$ as well as two bilinear forms $\omega, g \;\colon\; V \otimes_{\mathbb{R}} V \longrightarrow \mathbb{R}$ such that the following equivalent conditions hold:

$\omega(J v, J w) = \omega(v,w)$ and $g(v,w) = \omega(v,J w)$ ;
with $V$ regarded as a smooth manifold and with $\omega, g$ regarded as constant tensors, then $(V, \omega, g)$ is an almost Kähler manifold.

Example

(standard Kähler vector spaces)

Let $V \coloneqq \mathbb{R}^2$ equipped with the complex structure $J$ which is given by the canonical identification $\mathbb{R}^2 \simeq \mathbb{C}$ , hence, in terms of the canonical linear basis $(e_i)$ of $\mathbb{R}^2$ , this is

J = (J^i{}_j) \coloneqq \left( \array{ 0 & -1 \\ 1 & 0 } \right) \,.

Moreover let

\omega = (\omega_{i j}) \coloneqq \left( \array{0 & 1 \\ -1 & 0} \right)

and

g = (g_{i j}) \coloneqq \left( \array{ 1 & 0 \\ 0 & 1} \right) \,.

Then $(V, J, \omega, g)$ is a Kähler vector space (def. ).

The corresponding Kähler manifold is $\mathbb{R}^2$ regarded as a smooth manifold in the standard way and equipped with the bilinear forms $J, \omega g$ extended as constant rank-2 tensors over this manifold.

If we write

x,y \;\colon\; \mathbb{R}^2 \longrightarrow \mathbb{R}

for the standard coordinate functions on $\mathbb{R}^2$ with

z \coloneqq x + i y \;\coloneqq\; \mathbb{R}^2 \to \mathbb{C}

and

\overline{z} \coloneqq x - i y \;\coloneqq\; \mathbb{R}^2 \to \mathbb{C}

for the corresponding complex coordinates, then this translates to

\omega \in \Omega^2(\mathbb{R}^2)

being the differential 2-form given by

\begin{aligned} \omega & = d x \wedge d y \\ & = \tfrac{1}{2i} d z \wedge d \overline{z} \end{aligned}

and with Riemannian metric tensor given by

g = d x \otimes d x + d y \otimes d y \,.

The Hermitian form is given by

\begin{aligned} h & = g - i \omega \\ & = d z \otimes d \overline{z} \end{aligned}

(for more see at Kähler vector space this example).

Definition

(Wick algebra of a Kähler vector space)

Let $(\mathbb{R}^{2n},\sigma, g)$ be a Kähler vector space (def. ). Then its Wick algebra is the formal power series vector space $\mathbb{C}[ [ \mathbb{R}^{2n} ] ] [ [ \hbar ] ]$ equipped with the star product (def. ) which is given by the bilinear form

(217)

\pi \coloneqq \tfrac{i}{2} \omega^{-1} + \tfrac{1}{2} g^{-1} \,,

hence:

\begin{aligned} A_1 \star_\pi A_2 & \coloneqq ((-)\cdot (-)) \circ \exp \left( \hbar\underoverset{k_1, k_2 = 1}{2 n}{\sum}\pi^{a b} \partial_a \otimes \partial_b \right) (A_1 \otimes A_2) \\ & = A_1 \cdot A_2 + \hbar \underoverset{k_1, k_2 = 1}{2n}{\sum}\pi^{k_1 k_2}(\partial_{k_1} A_1) \cdot (\partial_{k_2} A_2) + \cdots \end{aligned}

(e.g. Collini 16, def. 1)

Proposition

(star product algebra of Kähler vector space is star-algebra)

Under complex conjugation the star product $\star_\pi$ of a Kähler vector space structure (def. ) is a star algebra in that for all $A_1, A_2 \in \mathbb{C}[ [\mathbb{R}^{2n}] ][ [\hbar] ]$ we have

\left( A_1 \star_\pi A_2 \right)^\ast \;=\; A_2^\ast \star_\pi A_1^\ast

Proof

This follows directly from that fact that in $\pi = \tfrac{i}{2} \omega^{-1} + \tfrac{1}{2} g^{-1}$ the imaginary part coincides with the skew-symmetric part, so that

\begin{aligned} (\pi^\ast)^{a b} & = -\tfrac{i}{2} (\omega^{-1})^{a b} + \tfrac{1}{2} (g^{-1})^{a b} \\ & = \tfrac{i}{2} (\omega^{-1})^{b a} + \tfrac{1}{2} (g^{-1})^{b a} \\ & = \pi^{b a} \,. \end{aligned}

Example

(Wick algebra of a single mode)

Let $V \coloneqq \mathbb{R}^2 \simeq Span(\{x,y\})$ be the standard Kähler vector space according to example , with canonical coordinates denoted $x$ and $y$ . We discuss its Wick algebra according to def. and show that this reproduces the traditional definition of products of “normal ordered” operators as above.

To that end, consider the complex linear combination of the coordinates to the canonical complex coordinates

z \;\coloneqq\; x + i y \phantom{AAA} \text{and} \phantom{AAA} \overline{z} \coloneqq x - i y

which we use in the form

a^\ast \;\coloneqq\; \tfrac{1}{\sqrt{2}}(x + i y) \phantom{AAA} \text{and} \phantom{AAA} a \;\coloneqq\; \tfrac{1}{\sqrt{2}}(x - i y)

(with “ $a$ ” the traditional symbol for the amplitude of a field mode).

Now

\omega^{-1} = \frac{\partial}{\partial y} \otimes \frac{\partial}{\partial x} - \frac{\partial}{\partial x} \otimes \frac{\partial}{\partial y}

g^{-1} = \frac{\partial}{\partial x} \otimes \frac{\partial}{\partial x} + \frac{\partial}{\partial y} \otimes \frac{\partial}{\partial y}

so that with

\frac{\partial}{\partial z} = \tfrac{1}{2} \left( \frac{\partial}{\partial x} -i \frac{\partial}{\partial y} \right) \phantom{AAAA} \frac{\partial}{\partial \overline{z}} = \frac{1}{2} \left( \frac{\partial}{\partial x} + i \frac{\partial}{\partial y} \right)

we get

\begin{aligned} \tfrac{i \hbar}{2}\omega^{-1} + \tfrac{\hbar}{2} g^{-1} & = 2 \hbar \frac{\partial}{\partial \overline{z}} \otimes \frac{\partial}{\partial z} \\ & = \hbar \frac{\partial}{\partial a} \otimes \frac{\partial }{\partial a^\ast} \end{aligned}

Using this, we find the star product

A \star_\pi B \;=\; prod \circ \exp\left( \hbar \frac{\partial}{\partial a} \otimes \frac{\partial }{\partial a^\ast} \right)

to be as follows (where we write $(-)\cdot (-)$ for the plain commutative product in the formal power series algebra):

\begin{aligned} a \star_\pi a & = a \cdot a \\ a^\ast \star_\pi a^\ast & = a^\ast \cdot a^\ast \\ a^\ast \star_\pi a & = a^\ast \cdot a \\ a \star_\pi a^\ast & = a \cdot a^\ast + \hbar \end{aligned}

and so forth, for instance

\array{ (a \cdot a ) \star_\pi (a^\ast \cdot a^\ast) & = a^\ast \cdot a^\ast \cdot a \cdot a + 4 \hbar a^\ast \cdot a + \hbar^2 }

If we instead indicate the commutative pointwise product by colons and the star product by plain juxtaposition

:f g: \;\coloneqq\; f \cdot g \phantom{AAAA} f g \;\coloneqq\; f \star_\pi

then this reads

\array{ :a a: \, :a^\ast a^\ast: & = : a^\ast a^\ast a a : + 4 \hbar \, : a^\ast a : + \hbar^2 }

This is the way the Wick algebra with its operator product $\star_\pi$ and normal-ordered product $:-:$ is traditionally presented.

$\,$

star products on regular polynomial observables in field theory

Proposition

(star products on regular polynomial observables induced from propagators)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory with field bundle $E \overset{fb}{\to} \Sigma$ , and let $\Delta \in \Gamma'_\Sigma((E \boxtimes E)^\ast)$ be a distribution of two variables on field histories.

On the off-shell regular polynomial observables with a formal paramater $\hbar$ adjoined consider the bilinear map

PolyObs(E)_{reg}[ [\hbar] ] \otimes PolyObs(E)_{reg} [ [ \hbar ] ] \overset{\star_{\Delta}}{\longrightarrow} PolyObs(E)_{reg}[ [\hbar] ]

given as in def. , but with partial derivatives replaced by functional derivatives

A_1 \star_{\Delta} A_2 \;\coloneqq\; ((-)\cdot(-)) \circ \exp\left( \int_\Sigma \Delta^{a b}(x,y) \frac{\delta}{\delta \Phi^a(x)} \otimes \frac{\delta}{\delta \Phi^b(y)} \right) (A_1 \otimes A_2)

As in prop. this defines a unital and associative algebra structure.

If the Euler-Lagrange equations of motion $P\Phi ) = 0$ induced by the Lagrangian density $\mathbf{L}$ are Green hyperbolic differential equations and if $\Delta$ is a homogeneous propagator for these differential equations in that $P \Delta = 0$ , then this star product algebra descends to the on-shell regular polynomial observables

PolyObs(E,\mathbf{L})_{reg}[ [\hbar] ] \otimes PolyObs(E, \mathbf{L})_{reg} [ [ \hbar ] ] \overset{\star_{\Delta}}{\longrightarrow} PolyObs(E, \mathbf{L})_{reg}[ [\hbar] ] \,.

Proof

The proof of prop. goes through verbatim in the present case, as long as all products of distributions that appear when the propagator is multiplied with the coefficients of the polynomial observables are well-defined, in that Hörmander's criterion (prop. ) on the wave front sets (def. ) of the propagator and of these coefficients is met. But the definition the coefficients of regular polynomial observables are non-singular distributions, whose wave front set is necessarily empty (example ), so that their product of distributions is always well-defined.

Corollary

(quantization of regular polynomial observables of gauge fixed free Lagrangian field theory)

Consider a gauge fixed (def. ) free Lagrangian field theory (def. ) with BV-BRST-extended field bundle (remark )

E_{\text{BV-BRST}} \;\coloneqq\; T^\ast_{\Sigma,inf}[-1] \left( E \times_\Sigma \mathcal{G}[1] \times_\Sigma A \times_\Sigma A[-1] \right)

and with causal propagator (95)

\Delta \;\in\; \Gamma'_{\Sigma \times \Sigma}( E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}} ) \,.

Then the star product $\star_\Delta$ (def. ) is well-defined on off-shell (as well as on-shell) regular polynomial observables (def. )

PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \otimes PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \overset{\star_{\tfrac{i}{2}\Delta}}{\longrightarrow} PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

and the resulting non-commutative algebra structure

\left( PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \,,\, \star_\Delta \right)

is a formal deformation quantization (def. ) of the Peierls-Poisson bracket on the covariant phase space (theorem ), restricted to regular polynomial observables.

(Dito 90, Dütsch-Fredenhagen 00 Dütsch-Fredenhagen 01, Hirshfeld-Henselder 02)

Proof

As in prop. , the vanishing of the wave front set of the coefficients of the regular polynomial observables implies that all arguments go through as for star products on polynomial algebras on finite dimensional vector spaces. By theorem the causal propagator is the integral kernel of the Peierls-Poisson bracket, so that the tensor $\pi$ from the definition of the Moyal star product (example ) now is

\pi = \Delta \,.

With this the statement follows by example .

Remark

(extending quantization beyond regular polynomial observables)

While cor. provides a quantization of the regular polynomial observables of any gauge fixed free Lagrangian field theory, the regular polynomial observables are too small a subspace of that of all polynomial observables:

By example the only local observables (def. ) contained among the regular polynomial observables are the linear observables (def. ). But in general it is necessary to consider also non-linear polynomial local observables. Notably the interaction action functionals $S_{int}$ induced from interaction Lagrangian densities $\mathbf{L}_{int}$ (example ) are non-linear polynomial observables.

For example:

For quantum electrodynamics on Minkowski spacetime (example ) the adiabatically switched action functional (example ) which is the transgression of the electron-photon interaction is a cubic local observable

$S_{int} \;=\; i \underset{\Sigma}{\int} g_{sw}(x) \, (\gamma^\mu)^{\alpha}{}_\beta \, \overline{\mathbf{\Psi}}_\alpha(x) \cdot \mathbf{\Psi}^\beta(x) \cdot \mathbf{A}^a(x) \, dvol_\Sigma(x)$
For scalar field phi^n theory (example ) the adiabatically switched action functional (example ) which is the transgression of the phi^n interaction

$S_{int} \;=\; \underset{\Sigma}{\int} g_{sw} \, \underset{ n \, \text{factors} }{ \underbrace{ \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(x) \cdots \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(x) } } \, dvol_\Sigma(x)$

is a local observable of order $n$ .

Therefore one needs to extend the formal deformation quantization provided by corollary to a larger subspace of polynomial observables that includes at least the local observables.

But prop. characterizes the freedom in choosing a formal deformation quantization: We may shift the causal propagator by a symmetric contribution. In view of prop. and in view of of Hörmander's criterion for the product of distributions (prop. ) to be well defined, we are looking for symmetric integral kernels $H$ such that the sum

(218)

\Delta_H = \tfrac{i}{2}\Delta + H

has a smaller wave front set (def. ) than $\tfrac{i}{2}\Delta$ itself has. The smaller $WF(\tfrac{i}{2}\Delta + H)$ , the larger the subspace of polynomial observables on which the corresponding formal deformation quantization exists.

Now by prop. the Wightman propagator $\Delta_H$ is of the form (218) and by prop. its wave front set is only “half” that of the causal propagator. It turns out that $\Delta_H$ does yield a formal deformation quantization of a subspace of polynomial observables that includes all local observables: this is the Wick algebra on microcausal polynomial observables. We discuss this in detail in the chapter Free quantum fields.

With such a formal deformation quantization of the local observables free field theory in hand, we may then finally obtain also a formal deformation quantization of interacting Lagrangian field theories by perturbation theory. This we discuss in the chapters Scattering and Quantum observables.

$\,$

This concludes our discussion of some basic concepts of quantization. In the next chapter we apply this to discuss the algebra of quantum observables of free Lagrangian field theories. Further below in the chapter Quantum observables we then discuss also the quantization of the interacting Lagrangian field theories, perturbatively.

Free quantum fields

In this chapter we discuss the following topics:

Wick algebra
Time-ordered product
Operator product notation
Hadamard vacuum state
Free quantum BV-differential
Schwinger-Dyson equation

$\,$

In the previous chapter we discussed quantization of linear phase spaces, which turns the algebra of observables into a noncommutative algebra of quantum observables. Here we apply this to the covariant phase spaces of gauge fixed free Lagrangian field theories (as discussed in the chapter Gauge fixing), obtaining genuine quantum field theory for free fields.

For this purpose we first need to find a sub-algebra of all observables which is large enough to contain all local observables (such as the phi^n interaction, example below, and the electron-photon interaction, example below) but small enough for the star product deformation quantization to meet Hörmander's criterion for absence of UV-divergences (remark ). This does exist (example below): It is called the algebra of microcausal polynomial observables (def. below).

types of observables in perturbative quantum field theory:

\array{ && \text{local} \\ && & \searrow \\ \text{field} &\longrightarrow& \text{linear} &\longrightarrow& \text{microcausal} &\longrightarrow& \text{polynomial} &\longrightarrow& \text{general} \\ && & \nearrow \\ && \text{regular} }

While the star product of the causal propagator still violates Hörmander's criterion for absence of UV-divergences on microcausal polynomial observables, we have seen in the previous chapter that qantization freedom allows to shift this Poisson tensor by a symmetric contribution. By prop. such a shift is provided by passage from the causal propagator to the Wightman propagator, and by prop. this reduces the wave front set and hence the UV-singularities “by half”.

This way the deformation quantization of the Peierls-Poisson bracket exists on microcausal polynomial observables as the star product algebra induced by the Wightman propagator. The resulting non-commutative algebra of observables is called the Wick algebra (prop. below). Its algebra structure may be expressed in terms of a commutative “normal-ordered product” (def. below) and the vacuum expectation values of field observables in a canonically induced vacuum state (prop. below).

The analogous star product induced by the Feynman propagator (def. below) acts by first causal ordering its arguments and then multiplying them with the Wick algebra product (prop. below) and hence is called the time-ordered product (def. below). This is the key structure in the discussion of interacting field theory discussed in the next chapter Interacting quantum fields. Here we consider this on regular polynomial observables only, hence for averages of field observables that evaluate at distinct spacetime points. The extension of the time-ordered product to local observables is possible, but requires making choices: This is called renormalization, which we turn to in the chapter Renormalization below.

	free field algebra of quantum observables	physics terminology	maths terminology
1)	supercommutative product	$\phantom{AA} :A_1 A_2:$ normal ordered product	$\phantom{AA} A_1 \cdot A_2$ pointwise product of functionals
2)	non-commutative product (deformation induced by Poisson bracket)	$\phantom{AA} A_1 A_2$ operator product	$\phantom{AA} A_1 \star_H A_2$ star product for Wightman propagator
3)		$\phantom{AA} T(A_1 A_2)$ time-ordered product	$\phantom{AA} A_1 \star_F A_2$ star product for Feynman propagator
	perturbative expansion of 2) via 1)	Wick's lemma	Moyal product for Wightman propagator $\Delta_H$ $\begin{aligned} & A_1 \star_H A_2 = \\ & ((-)\cdot (-)) \circ \exp \left( \hbar \int (\Delta_H)^{ab}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \right)(A_1 \otimes A_2) \end{aligned}$
	perturbative expansion of 3) via 1)	Feynman diagrams	Moyal product for Feynman propagator $\Delta_F$ $\begin{aligned} & A_1 \star_F A_2 = \\ & ((-)\cdot (-)) \circ \exp \left( \hbar \int (\Delta_F)^{ab}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \right)(A_1 \otimes A_2) \end{aligned}$

While the Wick algebra with its vacuum state provides a quantization of the algebra of observables of free gauge fixed Lagrangian field theories, the possible existence of infinitesimal gauge symmetries implies that the physically relevant observables are just the gauge invariant on-shell ones, exhibited by the cochain cohomology of the BV-BRST differential $\{-S' + S'_{BRST}, (-)\}$ . Hence to complete quantization of gauge theories, the BV-BRST differential needs to be lifted to the noncommutative algebra of quantum observables – this is called BV-BRST quantization.

To do so, we may regard the gauge fixed BRST-action functional $S'_{BRST}$ as an interaction term, to be dealt with later via scattering theory, and hence consider quantization of just the free BV-differential $\{-S',(-)\}$ . One finds that this is equal to its time-ordered version $\{-S',(-)\}_{\mathcal{T}}$ (prop. below) plus a quantum correction, called the BV-operator (def. below) or BV-Laplacian (prop. below).

Applied to observables this relation is the Schwinger-Dyson equation (prop. below), which expresses the quantum-correction to the equations of motion of the free gauge field Lagrangian field theory as seen by time-ordered products of observables (example below.)

After introducing field-interactions via scattering theory in the next chapter the quantum correction to the BV-differential by the BV-operator becomes the “quantum master equation” and the Schwinger-Dyson equation becomes the “master Ward identity”. When choosing renormalization these identities become conditions to be satisfied by renormalization choices in order for the interacting quantum BV-BRST differential, and hence for gauge invariant quantum observables, to be well defined in perturbative quantum field theory of gauge theories. This we discuss below in Renormalization.

$\,$

Wick algebra

The abstract Wick algebra of a free field theory with Green hyperbolic differential equation is directly analogous to the star product-algebra induced by a finite dimensional Kähler vector space (def. ) under the following identification of the Wightman propagator with the Kähler space-structure:

Remark

(Wightman propagator as Kähler vector space-structure)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory whose Euler-Lagrange equation of motion is a Green hyperbolic differential equation. Then the corresponding Wightman propagator is analogous to the rank-2 tensor on a Kähler vector space as follows:

covariant phase space of free Green hyperbolic Lagrangian field theory	finite dimensional Kähler vector space
space of field histories $\Gamma_\Sigma(E)$	$\mathbb{R}^{2n}$
symplectic form $\tau_{\Sigma_p} \Omega_{BFV}$	Kähler form $\omega$
causal propagator $\Delta$	$\omega^{-1}$
Peierls-Poisson bracket $\{A_1,A_2\} = \int \Delta^{a_1 a_2}(x_1,x_2) \frac{\delta A_1}{\delta \mathbf{\Phi}^{a_1}(x_1)} \frac{\delta A_2}{\delta \mathbf{\Phi}^{a_2}(x_2)} dvol_\Sigma(x)$	Poisson bracket
Wightman propagator $\Delta_H = \tfrac{i}{2} \Delta + H$	Hermitian form $\pi = \tfrac{i}{2}\omega^{-1} + \tfrac{1}{2}g^{-1}$

(Fredenhagen-Rejzner 15, section 3.6, Collini 16, table 2.1)

Definition

(microcausal polynomial observables)

Let $E \overset{fb}{\to} \Sigma$ be a field bundle which is a vector bundle, over some spacetime $\Sigma$ .

A polynomial observable (def. )

\begin{aligned} A & = \phantom{+} \alpha^{(0)} \\ & \phantom{=} + \int_{\Sigma} \mathbf{\Phi}^a(x) \alpha^{(1)}_a(x) \, dvol_\Sigma(x) \\ & \phantom{=} + \int_{\Sigma^2} \mathbf{\Phi}^{a_1}(x_1) \cdot \mathbf{\Phi}^{a_2}(x_2) \alpha^{(2)}_{a_1 a_2}(x_1, x_2) \, dvol_\Sigma(x_1) dvol_\Sigma(x_2) \\ & \phantom{=} + \int_{\Sigma^3} \mathbf{\Phi}^{a_1}(x_1) \cdot \mathbf{\Phi}^{a_2}(x_2) \cdot \mathbf{\Phi}^{a_3}(x_3) \alpha^{(3)}_{a_1 a_2 a_3}(x_1,x_2,x_3) \, dvol_\Sigma(x_1) dvol_\Sigma(x_2) dvol_\Sigma(x^3) \\ & \phantom{=} + \cdots \,. \end{aligned}

is called microcausal if each distributional coefficient

\alpha^{(k)} \;\in\; \Gamma'_{\Sigma^k}(E^{\boxtimes^k})

as above has wave front set (def. ) not containing those elements $(x_1, \cdots x_k, k_1, \cdots k_k)$ where the $k$ wave vectors are all in the closed future cone or all in the closed past cone (def. ).

We write

\array{ PolyObs(E)_{mc} &\hookrightarrow& PolyObs(E) \\ PolyObs(E,\mathbf{L})_{mc} \simeq PolyObs(E)_{mc}/im(P) &\hookrightarrow& PolyObs(E,\mathbf{L}) }

for the subspace of off-shell/on-shell microcausal polynomial observables inside all off-shell/on-shell polynomial observables.

The important point is that microcausal polynomial observables still contain all regular polynomial observables but also all polynomial local observables:

Example

(regular polynomial observables are microcausal)

Every regular polynomial observable (def. ) is microcausal (def. ).

Proof

By definition of regular polynomial observables, their coefficients are non-singular distributions and because the wave front set of non-singular distributions is empty (example )

Example

(polynomial local observables are microcausal)

Every polynomial local observable (def. ) is a microcausal polynomial observable (def. ).

Proof

For notational convenience, consider the case of the scalar field with $k = 2$ ; the general case is directly analogous. Then the local observable coming from $\phi^2$ (a phi^n interaction-term), has, regarded as a polynomial observable, the delta distribution $\delta(x_1-x_2)$ as coefficient in degree 2:

\begin{aligned} A(\Phi) & = \underset{\Sigma}{\int} g(x) (\Phi(x))^2 \,dvol_\Sigma(x) \\ & = \underset{\Sigma \times \Sigma}{\int} \underset{ = \alpha^{(2)}}{ \underbrace{ g(x_1) \delta(x_1 - x_2) }} \, \Phi(x_1) \Phi(x_2) \, dvol_\Sigma(x_1) \, dvol_\Sigma(x_2) \end{aligned} \,.

Now for $(x_1, x_2) \in \Sigma \times \Sigma$ and $\mathbb{R}^{2n} \simeq U \subset X \times X$ a chart around this point, the Fourier transform of distributions of $g \cdot \delta(-,-)$ restricted to this chart is proportional to the Fourier transform $\hat g$ of $g$ evaluated at the sum of the two covectors:

\begin{aligned} (k_1, k_2) & \mapsto \underset{\mathbb{R}^{2n}}{\int} g(x_1) \delta(x_1, x_2) e^{i (k_1 \cdot x_1 + k_2 \cdot x_2 )} \, dvol_\Sigma(x_1) \, dvol_\Sigma(x_2) \\ & \propto \hat g(k_1 + k_2) \end{aligned} \,.

Since $g$ is a plain bump function, its Fourier transform $\hat g$ is quickly decaying (according to prop. ) along $k_1 + k_2$ , as long as $k_1 + k_2 \neq 0$ . Only on the cone $k_1 + k_2 = 0$ the Fourier transform is constant, and hence in particular not decaying.

This means that the wave front set consists of the elements of the form $(x, (k, -k))$ with $k \neq 0$ . Since $k$ and $-k$ are both in the closed future cone or both in the closed past cone precisely if $k = 0$ , this situation is excluded in the wave front set and hence the distribution $g \cdot \delta(-,-)$ is microcausal.

(graphics grabbed from Khavkine-Moretti 14, p. 45)

Proposition

(Hadamard-Moyal star product on microcausal observables – abstract Wick algebra)

Let $(E,\mathbf{L})$ a free Lagrangian field theory with Green hyperbolic equations of motion $P \Phi = 0$ . Write $\Delta$ for the causal propagator and let

\Delta_H \;=\; \tfrac{i}{2}\Delta + H

be a corresponding Wightman propagator (Hadamard 2-point function).

Then the star product induced by $\Delta_H$

A \star_H A \;\coloneqq\; prod \circ \exp\left( \int_{X^2} \hbar \Delta_H^{a b}(x_1, x_2) \frac{\delta}{\delta \mathbf{\Phi}^a(x_1)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(x_2)} dvol_g \right) (P_1 \otimes P_2)

on off-shell microcausal observables $A_1, A_2 \in \mathcal{F}_{mc}$ (def. ) is well defined in that the wave front sets involved in the products of distributions that appear in expanding out the exponential satisfy Hörmander's criterion.

Hence by the general properties of star products (prop. ) this yields a unital associative algebra structure on the space of formal power series in $\hbar$ of off-shell microcausal observables

\left( PolyObs(E)_{mc}[ [\hbar] ] \,,\, \star_H \right) \,.

This is the off-shell Wick algebra corresponding to the choice of Wightman propagator $H$ .

Moreover the image of $P$ is an ideal with respect to this algebra structure, so that it descends to the on-shell microcausal observables to yield the on-shell Wick algebra

\left( PolyObs(E,\mathbf{L})_{mc}[ [ \hbar ] ] \,,\, \star_H \right) \,.

Finally, under complex conjugation $(-)^\ast$ these are star algebras in that

\left( A_1 \star_H A_2 \right)^\ast = A_2^\ast \star_H A_1^\ast \,.

(e.g. Collini 16, p. 25-26)

Proof

By prop. the wave front set of $\Delta_H$ has all cotangents on the first variables in the closed future cone (at the given base point, which itself is on the light cone)

and hence all those on the second variables in the closed past cone.

The first variables are integrated against those of $A_1$ and the second against $A_2$ . By definition of microcausal observables (def. ), the wave front sets of $A_1$ and $A_2$ are disjoint from the subsets where all components are in the closed future cone or all components are in the closed past cone. Therefore the relevant sum of of the wave front covectors never vanishes and hence Hörmander's criterion (prop. ) for partial products of distributions of several variables (prop. ).

It remains to see that the star product $A_1 \star_H A_2$ is itself again a microcausal observable. It is clear that it is again a polynomial observable and that it respects the ideal generated by the equations of motion. That it still satisfies the condition on the wave front set follows directly from the fact that the wave front set of a product of distributions is inside the fiberwise sum of elements of the factor wave front sets (prop. , prop. ).

Finally the star algebra-structure via complex conjugation follows via remark as in prop. .

Remark

(Wick algebra is formal deformation quantization of Poisson-Peierls algebra of observables)

Let $(E,\mathbf{L})$ a free Lagrangian field theory with Green hyperbolic equations of motion $P \Phi = 0$ with causal propagator $\Delta$ and let $\Delta_H \;=\; \tfrac{i}{2}\Delta + H$ be a corresponding Wightman propagator (Hadamard 2-point function).

Then the Wick algebra $\left( PolyObs(E,\mathbf{L})_{mc}[ [\hbar] ] \,,\, \star_H \right)$ from prop. is a formal deformation quantization of the Poisson algebra on the covariant phase space given by the on-shell polynomial observables equipped with the Poisson-Peierls bracket $\{-,-\} \;\colon\; PolyObs(E,\mathbf{L})_{mc} \otimes PolyObs(E,\mathbf{L})_{mc} \to PolyObs(E,\mathbf{L})_{mc}$ in that for all $A_1, A_2 \in PolyObs(E,\mathbf{L})_{mc}$ we have

A_1 \star_H A_2 \;=\; A_1 \cdot A_2 \;mod\; \hbar

and

A_1 \star_H A_2 - A_2 \star_H a_1 \;=\; i \hbar \{A_1, A_2\} \;mod\; \hbar^2 \,.

(Dito 90, Dütsch-Fredenhagen 00, Dütsch-Fredenhagen 01, Hirshfeld-Henselder 02)

Proof

By prop. this is immediate from the general properties of the star product (example ).

Explicitly, consider, without restriction of generality, $A_1 = \int (\alpha_1)_a(x) \mathbf{\Phi}^a(x)\, dvol_\Sigma(x)$ and $A_2 = \int (\alpha_2)_a(x) \mathbf{\Phi}^a(x)\, dvol_\Sigma(x)$ be two linear observables. Then

\begin{aligned} & A_1 \star_H A_2 \\ & = \phantom{+} A_1 A_2 \\ & \phantom{=} + \hbar \int \left( \tfrac{i}{2} \Delta^{a_1 a_2}(x_1, x_2) + H^{a_1 a_2}(x_1,x_2) \right) \frac{\partial A_1}{\partial \mathbf{\Phi}^{a_1}(x_1)} \frac{\partial A_2}{\partial \mathbf{\Phi}^{a_2}(x_2)} \;mod\; \hbar^2 \\ & = \phantom{+} A_1 A_2 \\ & \phantom{=} + \hbar \left( \int (\alpha_1)_{a_1}(x_1) \left( \tfrac{i}{2}\Delta^{a_1 a_2}(x_1, x_2) + H^{a_1 a_2}(x_1, x_2) \right) (\alpha_2)_{a_2}(x_2) \right) \;mod\; \hbar^2 \end{aligned}

Now since $\Delta$ is skew-symmetric while $H$ is symmetric (prop. ) it follows that

\begin{aligned} A_1 \star_H A_2 - A_2 \star_H A_1 & = i \hbar \left( \int (\alpha_1)_{a_1}(x_1) \Delta^{a_1 a_2}(x_1, x_2) (\alpha_2)_{a_2}(x_2) \right) \;mod\; \hbar^2 \\ & = i \hbar \, \left\{ A_1, A_2\right\} \end{aligned} \,.

The right hand side is the integral kernel-expression for the Poisson-Peierls bracket, as shown in the second line.

$\,$

time-ordered product

Definition

(time-ordered product on regular polynomial observables)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory over a Lorentzian spacetime and with Green-hyperbolic Euler-Lagrange differential equations; write $\Delta_S = \Delta_+ - \Delta_-$ for the induced causal propagator. Let moreover $\Delta_H = \tfrac{i}{2}\Delta_S + H$ be a compatible Wightman propagator and write $\Delta_F = \tfrac{i}{2}(\Delta_+ + \Delta_-) + H$ for the induced Feynman propagator.

Then the time-ordered product on the space of off-shell regular polynomial observable $PolyObs(E)_{reg}$ is the star product induced by the Feynman propagator (via prop. ):

\array{ PolyObs(E)_{reg}[ [\hbar] ] \otimes PolyObs(E)_{reg}[ [\hbar] ] &\overset{}{\longrightarrow}& PolyObs(E)_{reg}[ [\hbar] ] \\ (A_1, A_2) &\mapsto& \phantom{\coloneqq} A_1 \star_F A_2 }

hence

A_1 \star_F A_2 \; \coloneqq \; ((-)\cdot(-)) \circ \exp\left( \underset{\Sigma \times \Sigma}{\int} \Delta_F^{a b}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \, dvol_\Sigma(x) \, dvol_\Sigma(y) \right)

(Notice that this does not descend to the on-shell observables, since the Feynman propagator is not a solution to the homogeneous equations of motion.)

Proposition

(time-ordered product is indeed causally ordered Wick algebra product)

Then the time-ordered product on regular polynomial observables (def. ) is indeed a time-ordering of the Wick algebra product $\star_H$ in that for all pairs of regular polynomial observables

A_1, A_2 \in PolyObs(E)_{reg}[ [\hbar] ]

with disjoint spacetime support we have

A_1 \star_F A_2 \;=\; \left\{ \array{ A_1 \star_H A_2 &\vert& supp(A_1) {\vee\!\!\!\wedge} supp(A_2) \\ A_2 \star_H A_1 &\vert& supp(A_2) {\vee\!\!\!\wedge} supp(A_2) } \right. \,.

Here $S_1 {\vee\!\!\!\wedge} S_2$ is the causal order relation (“ $S_1$ does not intersect the past cone of $S_2$ ”). Beware that for general pairs $(S_1, S_2)$ of subsets neither $S_1 {\vee\!\!\!\wedge} S_2$ nor $S_2 {\vee\!\!\!\wedge} S_1$ .

Proof

Recall the following facts:

the advanced and retarded propagators $\Delta_{\pm}$ by definition are supported in the future cone/past cone, respectively

$supp(\Delta_{\pm}) \subset \overline{V}^{\pm}$
they turn into each other under exchange of their arguments (cor. ):

$\Delta_\pm(y,x) = \Delta_{\mp}(x,y) \,.$
the real part $H$ of the Feynman propagator, which by definition is the real part of the Wightman propagator is symmetric (by definition or else by prop. ):

$H(x,y) = H(y,x)$

Using this we compute as follows:

\begin{aligned} A_1 \underset{\Delta_{F}}{\star} A_2 \; & = A_1 \underset{\tfrac{i}{2}(\Delta_+ + \Delta_-) + H}{\star} A_2 \\ & = \left\{ \array{ A_1 \underset{\tfrac{i}{2}\Delta_+ + H}{\star} A_2 &\vert& supp(A_1) {\vee\!\!\!\wedge} supp(A_2) \\ A_1 \underset{\tfrac{i}{2}\Delta_- + H}{\star} A_2 &\vert& supp(A_2) {\vee\!\!\!\wedge} supp(A_2) } \right. \\ & = \left\{ \array{ A_1 \underset{\tfrac{i}{2}\Delta_+ + H}{\star} A_2 &\vert& supp(A_1) {\vee\!\!\!\wedge} supp(A_2) \\ A_2 \underset{\tfrac{i}{2}\Delta_+ + H}{\star} A_1 &\vert& supp(A_2) {\vee\!\!\!\wedge} supp(A_2) } \right. \\ & = \left\{ \array{ A_1 \underset{\tfrac{i}{2}(\Delta_+ - \Delta_-) + H}{\star} A_2 &\vert& supp(A_1) {\vee\!\!\!\wedge} supp(A_2) \\ A_2 \underset{\tfrac{i}{2}(\Delta_+ - \Delta_-) + H}{\star} A_1 &\vert& supp(A_2) {\vee\!\!\!\wedge} supp(A_2) } \right. \\ & = \left\{ \array{ A_1 \underset{\Delta_H}{\star} A_2 &\vert& supp(A_1) {\vee\!\!\!\wedge} supp(A_2) \\ A_2 \underset{\Delta_H}{\star} A_1 &\vert& supp(A_2) {\vee\!\!\!\wedge} supp(A_2) } \right. \end{aligned}

Proposition

(time-ordered product on regular polynomial observables isomorphic to pointwise product)

The time-ordered product on regular polynomial observables (def. ) is isomorphic to the pointwise product of observables (def. ) via the linear isomorphism

\mathcal{T} \;\colon\; PolyObs(E)_{reg}[ [\hbar] ] \longrightarrow PolyObs(E)_{reg}[ [\hbar] ]

given by

(219)

\mathcal{T}A \;\coloneqq\; \exp\left( \tfrac{1}{2} \hbar \underset{\Sigma}{\int} \Delta_F(x,y)^{a b} \frac{\delta^2}{\delta \mathbf{\Phi}^a(x) \delta \mathbf{\Phi}^b(y)} \right) A

in that

\begin{aligned} T(A_1 A_2) & \coloneqq A_1 \star_{F} A_2 \\ & = \mathcal{T}( \mathcal{T}^{-1}(A_1) \cdot \mathcal{T}^{-1}(A_2) ) \end{aligned}

hence

\array{ PolyObs(E)_{reg}[ [\hbar] ] \otimes PolyObs(E)_{reg}[ [\hbar] ] &\overset{(-)\cdot (-)}{\longrightarrow}& PolyObs(E)_{reg}[ [\hbar] ] \\ {}^{\mathllap{\mathcal{T} \otimes \mathcal{T}}}_\simeq\Big\downarrow && \Big\downarrow{}^{\mathrlap{\mathcal{T}}}_\simeq \\ PolyObs(E)_{reg}[ [\hbar] ] \otimes PolyObs(E)_{reg}[ [\hbar] ] &\overset{(-) \star_F (-)}{\longrightarrow}& PolyObs(E)_{reg}[ [\hbar] ] }

(Brunetti-Dütsch-Fredenhagen 09, (12)-(13), Fredenhagen-Rejzner 11b, (14))

Proof

Since the Feynman propagator is symmetric (prop. ), the statement is a special case of prop. .

Example

(time-ordered exponential of regular polynomial observables)

Let $V \in PolyObs_{reg, deg = 0}[ [ \hbar ] ]$ be a regular polynomial observable (def. ) of degree zero, and write

\exp(V) = 1 + V + \tfrac{1}{2!} V \cdot V + \tfrac{1}{3!} V \cdot V \cdot V + \cdots

for the exponential of $V$ with respect to the pointwise product (89).

Then the exponential $\exp_{\mathcal{T}}(V)$ of $V$ with respect to the time-ordered product $\star_F$ (def. ) is equal to the conjugation of the exponential with respect to the pointwise product by the time-ordering isomorphism $\mathcal{T}$ from prop. :

\begin{aligned} \exp_{\mathcal{T}}(V) & \coloneqq 1 + V + \tfrac{1}{2} V \star_F V + \tfrac{1}{3!} V \star_F V \star_F V + \cdots \\ & = \mathcal{T} \circ \exp(-) \circ \mathcal{T}^{-1}(V) \,. \end{aligned}

Remark

(renormalization of time-ordered product)

The time-ordered product on regular polynomial observables from prop. extends to a product on polynomial local observables (def. ), then taking values in microcausal observables (def. ):

T \;\colon\; PolyLocObs(E)^{\otimes_n}[ [\hbar] ] \longrightarrow PolyObs(E)_{mc}[ [\hbar] ] \,.

This extension is not unique. A choice of such an extension, satisfying some evident compatibility conditions, is a choice of renormalization scheme for the given perturbative quantum field theory. Every such choice corresponds to a choice of perturbative S-matrix for the theory, namely an extension of the time-ordered exponential $\exp_{\mathcal{T}}$ (example ) from regular to local observables.

This construction of perturbative quantum field theory is called causal perturbation theory. We discuss this below in the chapters Interacting quantum fields and Renormalization.

$\,$

operator product notation

Definition

(notation for operator product and normal-ordered product)

It is traditional to use the following alternative notation for the product structures on microcausal polynomial observables:

The Wick algebra-product, hence the star product $\star_H$ for the Wightman propagator (def. ), is rewritten as plain juxtaposition:

$\text{"operator product"} \phantom{AAA} A_1 A_2 \phantom{AA} \coloneqq \phantom{AA} A_1 \star_H A_1 \phantom{AAAA} \array{ \text{star product of} \\ \text{Wightman propagator} } \,.$
The pointwise product of observables (def. ) $A_1 \cdot A_2$ is equivalently written as plain juxtaposition enclosed by colons:

$\array{ \text{"normal-ordered} \\ \text{product"} } \phantom{AAAA} :A_1 A_2: \phantom{AA}\coloneqq\phantom{AA} A_1 \cdot A_2 \phantom{AAAA} \phantom{AAa}\text{pointwise product}\phantom{AAa}$
The time-ordered product, hence the star product for the Feynman propagator $\star_F$ (def. ) is equivalently written as plain juxtaposition prefixed by a “ $T$ ”

$\array{ \text{"time-ordered} \\ \text{product"} } \phantom{AAAA} T(A_1 A_2) \phantom{AA}\coloneqq\phantom{AA} A_1 \star_F A_2 \phantom{AAAA} \array{ \text{star product of} \\ \text{Feynman propagator} }$

Under representation of the Wick algebra on a Fock Hilbert space by linear operators the first product becomes the operator product, while the second becomes the operator poduct applied after suitable re-ordering, called “normal odering” of the factors.

Disregarding the Fock space-representation, which is faithful, we may still refer to these “abstract” products as the “operator product” and the “normal-ordered product”, respectively.

$\,$

Example

(phi^n interaction)

Consider phi^n theory from example . The adiabatically switched action functional (example ) which is the transgression of the phi^n interaction is the following local (hence, by example , microcausal) observable:

\begin{aligned} S_{int} & = \underset{\Sigma}{\int} \underset{ n \, \text{factors} }{ \underbrace{ \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(x) \cdots \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(x) } } \, dvol_\Sigma(x) \\ & = \underset{\Sigma}{\int} : \underset{ n \, \text{factors} }{ \underbrace{ \mathbf{\Phi}(x) \mathbf{\Phi}(x) \cdots \mathbf{\Phi}(x) \mathbf{\Phi}(x) } } : \, dvol_\Sigma(X) \end{aligned} \,,

Here in the first line we have the integral over a pointwise product (def. ) of $n$ field observables (example ), which in the second line we write equivalently as a normal ordered product by def. .

Example

(electron-photon interaction)

Consider the Lagrangian field theory defining quantum electrodynamics from example . The adiabatically switched action functional (example ) which is the transgression of the electron-photon interaction is the local (hence, by example , microcausal) observable

\begin{aligned} S_{int} & \coloneqq i \underset{\Sigma}{\int} g_{sw}(x) \, (\Gamma^\mu)^\alpha{}_\beta \, \overline{\mathbf{\Psi}}_\alpha(x) \cdot \mathbf{\Psi}^\beta(x) \cdot \mathbf{A}_\mu(x) \, dvol_\Sigma(x) \\ & = i \underset{\Sigma}{\int} g_{sw}(x) \, (\Gamma^\mu)^\alpha{}_\beta \, : \overline{\mathbf{\Psi}}_\alpha(x) \mathbf{\Psi}^\beta(x) \mathbf{A}_\mu(x) : \, dvol_\Sigma(x) \end{aligned} \,,

Here in the first line we have the integral over a pointwise product (def. ) of $n$ field observables (example ), which in the second line we write equivalently as a normal ordered product by def. .

(e.g. Scharf 95, (3.3.1))

$\,$

Hadamard vacuum state

Proposition

(canonical vacuum states on abstract Wick algebra)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory with Green-hyperbolic Euler-Lagrange equations of motion; and let $\Delta_H$ be a compatible Wightman propagator.

For

\Phi_0 \in \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L} = 0}

any on-shell field history (i.e. solving the equations of motion), consider the function from the Wick algebra to formal power series in $\hbar$ with coefficients in the complex numbers which evaluates any microcausal polynomial observable on $\Phi_0$

\array{ PolyObs(E,\mathbf{L})_{mc}[ [[\hbar] ] &\overset{\langle -\rangle_{\Phi_0}}{\longrightarrow}& \mathbb{C}[ [\hbar] ] \\ A &\mapsto& A(\Phi_0) }

Specifically for $\Phi_0 = 0$ (which is a solution of the equations of motion by the assumption that $(E,\mathbf{L})$ defines a free field theory) this is the function

\array{ PolyObs(E,\mathbf{L})_{mc}[ [[\hbar] ] &\overset{\langle -\rangle_0}{\longrightarrow}& \mathbb{C}[ [\hbar] ] \\ \left. \begin{aligned} A & = \alpha^{(0)} \\ & \phantom{=} + \underset{\Sigma}{\int} \alpha^{(1)}_a(x) \mathbf{\Phi}^a(x) \, dvol_\Sigma(x) \\ & \phantom{=} + \cdots \end{aligned} \right\} &\mapsto& A(0) = \alpha^{(0)} }

which sends each microcausal polynomial observable to its value $A(\Phi = 0)$ on the zero field history, hence to the constant contribution $\alpha^{(0)}$ in its polynomial expansion.

The function $\langle -\rangle_0$ is

linear over $\mathbb{C}[ [\hbar] ]$ ;
real, in that for all $A \in PolyObs(E,\mathbf{L})_{mc}[ [\hbar] ]$

$\langle A^\ast \rangle = \langle A \rangle^\ast$
positive, in that for every $A \in PolyObs(E,\mathbf{L})_{mc}[ [\hbar] ]$ there exist a $c_A \in \mathbb{C}[ [\hbar] ]$ such that

$\langle A^\ast \star_H A\rangle_{\Phi_0} = c_A^\ast \cdot c_A \,,$
normalized, in that

$\langle 1\rangle_H = 1$

where $(-)^\ast$ denotes componet-wise complex conjugation.

This means that $\langle -\rangle_{0}$ is a state on the Wick star-algebra $\left( (PolyObs(E,\mathbf{L}))_{mc}[ [\hbar] ], \star_H\right)$ (prop. ). One says that

$\langle - \rangle_0$ is a Hadamard vacuum state;

and generally

$\langle - \rangle_{\Phi_0}$ is called a coherent state.

(Dütsch 18, def. 2.12, remark 2.20, def. 5.28, exercise 5.30 and equations (5.178))

Proof

The properties of linearity, reality and normalization are obvious, what requires proof is positivity. This is proven by exhibiting a representation of the Wick algebra on a Fock Hilbert space (this algebra homomorphism is Wick's lemma), with formal powers in $\hbar$ suitably taken care of, and showing that under this representation the function $\langle -\rangle_0$ is represented, degreewise in $\hbar$ , by the inner product of the Hilbert space.

Example

(operator product of two linear observables)

Let

A_i \in LinObs(E,\mathbf{L})_{mc} \hookrightarrow PolyObs(E,\mathbf{L})_{mc}

for $i \in \{1,2\}$ be two linear microcausal observables represented by distributions which in generalized function-notation are given by

A_i \;=\; \int (\alpha_i)_{a_i}(x_i) \mathbf{\Phi}^{a_i}(x_i) \, dvol_\Sigma(x_i) \,.

Then their Hadamard-Moyal star product (prop. ) is the sum of their pointwise product with their value

(220)

\langle A_1 \star_H A_2 \rangle_0 \;\coloneqq\; i \hbar \int \int (\alpha_1)_{a_1}(x_1) \Delta_H^{a_1 a_2}(x_1,x_2) (\alpha_2)_{a_2}(x_2) \,dvol_\Sigma(x_1) \,dvol_\Sigma(x_2)

in the Wightman propagator, which is the value of the Hadamard vacuum state from prop. :

A_1 \star_H A_2 \;=\; A_1 \cdot A_2 \;+\; \langle A_1 \star_H A_2 \rangle_0

In the operator product/normal-ordered product-notation of def. this reads

A_1 A_2 \;=\; :A_1 A_2: \;+\; \langle A_1 A_2\rangle \,.

Example

(Weyl relations)

Let $(E,\mathbf{L})$ a free Lagrangian field theory with Green hyperbolic equations of motion and with Wightman propagator $\Delta_H$ .

Then for

A_1, A_2 \;\in\; LinObs(E,\mathbf{L})_{mc} \hookrightarrow PolyObs(E,\mathbf{L})_{mc}

two linear microcausal observables, the Hadamard-Moyal star product (def. ) of their exponentials exhibits the Weyl relations:

e^{A_1} \star_H e^{A_2} \;=\; e^{A_1 + A_2} \; e^{\langle A_1 \star_H A_2\rangle_0}

where on the right we have the exponential of the value of the Hadamard vacuum state (prop. ) as in example .

(e.g. Dütsch 18, exercise 2.3)

Example

(Wightman propagator is 2-point function in the Hadamard vacuum state)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory with Green-hyperbolic Euler-Lagrange equations of motion; and let $\Delta_H$ be a compatible Wightman propagator.

With respect to the induced Hadamard vacuum state $\langle - \rangle_0$ from prop. , the Wightman propagator $\Delta_H(x,y)$ itself is the 2-point function, namely the distributional vacuum expectation value of the operator product of two field observables:

\left\langle \mathbf{\Phi}^a(x) \star_H \mathbf{\Phi}^b(y) \right\rangle_0 \;=\; \underset{ = 0 }{ \underbrace{ \left\langle \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(y) \right\rangle }} + \underset{ = \hbar \Delta^{a b}_H(x,y) }{ \underbrace{ \left \langle \hbar \underset{\Sigma \times \Sigma}{\int} \delta(x-x') \Delta^{a b}_H(x,y) \delta(y-y') \right\rangle }}

by example .

Equivalently in the operator product-notation of def. this reads:

\left\langle \mathbf{\Phi}^a(x) \mathbf{\Phi}^b(y) \right\rangle_0 \;=\; \hbar \Delta_H(x,y) \,.

Similarly:

Example

(Feynman propagator is time-ordered 2-point function in the Hadamard vacuum state)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory with Green-hyperbolic Euler-Lagrange equations of motion; and let $\Delta_H$ be a compatible Wightman propagator with induced Feynman propagator $\Delta_F$ .

With respect to the induced Hadamard vacuum state $\langle - \rangle_0$ from prop. , the Feynman propagator $\Delta_F(x,y)$ itself is the time-ordered 2-point function, namely the distributional vacuum expectation value of the time-ordered product (def. ) of two field observables:

\left\langle T\left( \mathbf{\Phi}^a(x) \star_F \mathbf{\Phi}^b(y) \right) \right\rangle_0 \;=\; \underset{ = 0 }{ \underbrace{ \left\langle \mathbf{\Phi}(x) \cdot \mathbf{\Phi}(y) \right\rangle }} + \underset{ = \hbar \Delta^{a b}_H(x,y) }{ \underbrace{ \left \langle \hbar \underset{\Sigma \times \Sigma}{\int} \delta(x-x') \Delta^{a b}_F(x,y) \delta(y-y') \right\rangle }}

analogous to example .

Equivalently in the operator product-notation of def. this reads:

\left\langle T\left( \mathbf{\Phi}^a(x) \mathbf{\Phi}^b(y) \right) \right\rangle_0 \;=\; \hbar \Delta_F(x,y) \,.

propagators (i.e. integral kernels of Green functions)
for the wave operator and Klein-Gordon operator
on a globally hyperbolic spacetime such as Minkowski spacetime:

name	symbol	wave front set	as vacuum exp. value of field operators	as a product of field operators
causal propagator	$\begin{aligned}\Delta_S & = \Delta_+ - \Delta_- \end{aligned}$	$\phantom{A}\,\,\,-$	$\begin{aligned} & i \hbar \, \Delta_S(x,y) = \\ & \left\langle \;\left[\mathbf{\Phi}(x),\mathbf{\Phi}(y)\right]\; \right\rangle \end{aligned}$	Peierls-Poisson bracket
advanced propagator	$\Delta_+$		$\begin{aligned} & i \hbar \, \Delta_+(x,y) = \\ & \left\{ \array{ \left\langle \; \left[ \mathbf{\Phi}(x),\mathbf{\Phi}(y) \right] \; \right\rangle &\vert& x \geq y \\ 0 &\vert& y \geq x } \right. \end{aligned}$	future part of Peierls-Poisson bracket
retarded propagator	$\Delta_-$		$\begin{aligned} & i \hbar \, \Delta_-(x,y) = \\ & \left\{ \array{ \left\langle \; \left[\mathbf{\Phi}(x),\mathbf{\Phi}(y) \right] \; \right\rangle &\vert& y \geq x \\ 0 &\vert& x \geq y } \right. \end{aligned}$	past part of Peierls-Poisson bracket
Wightman propagator	$\begin{aligned} \Delta_H &= \tfrac{i}{2}\left( \Delta_+ - \Delta_-\right) + H\\ & = \tfrac{i}{2}\Delta_S + H \\ & = \Delta_F - i \Delta_- \end{aligned}$		$\begin{aligned} & \hbar \, \Delta_H(x,y) \\ & = \left\langle \; \mathbf{\Phi}(x) \mathbf{\Phi}(y) \; \right\rangle \\ & = \underset{ = 0 }{\underbrace{\left\langle \; : \mathbf{\Phi}(x) \mathbf{\Phi}(y) : \; \right\rangle}} \\ & \phantom{=} + \left\langle \; \left[ \mathbf{\Phi}^{(-)}(x), \mathbf{\Phi}^{(+)}(y) \right] \; \right\rangle \end{aligned}$	positive frequency of Peierls-Poisson bracket, Wick algebra-product, 2-point function $\phantom{=}$ of vacuum state $\phantom{=}$ or generally of $\phantom{=}$ Hadamard state
Feynman propagator	$\begin{aligned}\Delta_F & = \tfrac{i}{2}\left( \Delta_+ + \Delta_- \right) + H \\ & = i \Delta_D + H \\ & = \Delta_H + i \Delta_- \end{aligned}$		$\begin{aligned} & \hbar \, \Delta_F(x,y) \\ & = \left\langle \; T\left( \; \mathbf{\Phi}(x)\mathbf{\Phi}(y) \;\right) \; \right\rangle \\ & = \left\{ \array{ \left\langle \; \mathbf{\Phi}(x)\mathbf{\Phi}(x) \; \right\rangle &\vert& x \geq y \\ \left\langle \; \mathbf{\Phi}(y) \mathbf{\Phi}(x) \; \right\rangle &\vert& y \geq x } \right.\end{aligned}$	time-ordered product

(see also Kocic‘s overview: pdf)

$\,$

free quantum BV-differential

So far we have discussed the plain (graded-commutative) algebra of quantum observables of a gauged fixed free Lagrangian field theory, deforming the commutative pointwise product of observables. But after gauge fixing, the algebra of observables is not just a (graded-commutative) algebra, but carries also a differential making it a differential graded-commutative superalgebra: the global BV-differential $\{-S' + S_{BRST}, -\}$ (def. ). The gauge invariant on-shell observables are (only) the cochain cohomology of this differential. Here we discuss what becomes of this differential as we pass to the non-commutative Wick-algebra of quantum observables.

Proposition

(global BV-differential on Wick algebra)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory (def. ) with gauge fixed BV-BRST Lagrangian density $-\mathbf{L}' + \mathbf{L}'_{BRST}$ (def. ) on a graded BV-BRST field bundle $E_{\text{BV-BRST}} \coloneqq T^\ast[-1]_{\Sigma,inf}(E \times_\Sigma \mathcal{G}[1] \times_{\Sigma} A \times_\Sigma A[-1])$ (remark ). Let $\Delta_H$ be a compatible Wightman propagator (def. ).

Then the global BV-differential $\{-S',(-)\}$ (def. ) restricts from polynomial observables to a linear map on microcausal polynomial observables (def. )

\{-S',(-)\} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar] ]

and as such is a derivation not only for the pointwise product, but also for the product in the Wick algebra (the star product induced by the Wightman propagator):

\{-S', A_1 \star_H A_2\} \;=\; \{-S', A_1\} \star_H A_2 + A_1 \star_H \{-S', A_2\} \,.

We call $\{-S,(-)\}$ regarded as a nilpotent derivation on the Wick algebra this way the free quantum BV-differential.

(Fredenhagen-Rejzner 11b, below (37), Rejzner 11, below (5.28))

Proof

By example the action of $\{-S',(-)\}$ on polyomial observables is to replace antifield field observables by

\mathbf{\Phi}^\ddagger_a(x) \;\mapsto\; \pm (P_{A B}\mathbf{\Phi}^A)(x) \,,

where $P$ is a differential operator. By partial integration this translates to $\{-S',(-)\}$ acting by the formally adjoint differential operator $P^\ast$ (def. ) via distributional derivative on the distributional coefficients of the given polynomial observable.

Now by prop. the application of $P^\ast$ retains or shrinks the wave front set of the distributional coefficient, hence it preserves the microcausality condition (def. ). This makes $\{-S',(-)\}$ restrict to microcausal polynomial observables.

To see that $\{-S',(-)\}$ thus restricted is a derivation of the Wick algebra product, it is sufficient to see that its commutators with the Wightman propagator vanish in each argument:

\left[ \{-S',(-)\} \otimes id \;,\; \Delta_H \left( \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right] \;=\; 0

and

\left[ id \otimes \{-S',(-)\} \;,\; \Delta_H \left( \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right] \;=\; 0 \,.

Because with this we have:

\begin{aligned} \{-S', A_1 \star_H A_2\} & = \{-S',(-)\} \circ ((-)\cdot(-)) \circ \exp\left( \hbar \Delta_H\left( \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right) (A_1 \otimes A_2) \\ & = ((-)\cdot(-)) \circ \left( \phantom{a \atop a} \{-S',-\} \otimes id + id \otimes \{-S',(-)\} \right) \circ \exp\left( \hbar \Delta_H\left( \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right) (A_1 \otimes A_2) \\ & = ((-)\cdot(-)) \circ \exp\left( \hbar \Delta_H\left( \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right) \circ \left( \phantom{a \atop b} \{-S',-\} \otimes id + id \otimes \{-S',(-)\} \right) (A_1 \otimes A_2) \\ & = \{-S',A_1\} \star_H A_2 + A_1 \star_H \{-S', A_2\} \end{aligned}

Here in the first step we used that $\{-S',(-)\}$ is a derivation with respect to the pointwise product, by construction (def. ) and then we used the vanishing of the above commutators.

To see that these commutators indeed vanish, use that by example we have

\begin{aligned} & \left[ \{-S',(-)\} \otimes id \;,\; \Delta_H\left( \frac{\delta }{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right) \right] \\ & = \left[ \underset{A}{\sum} (-1)^{deg(\phi^A)} \underset{\Sigma}{\int} (P_{A B}\mathbf{\Phi}^A)(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \otimes id \, dvol_\Sigma(x) \;\,\; \underset{\Sigma \times \Sigma}{\int} \Delta_H^{A B}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^A(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^B(y)} \, dvol_\Sigma(x) \, dvol_\Sigma(y) \right] \\ & = -\underset{a}{\sum} (-1)^{deg(\phi^A)} \underset{\Sigma \times \Sigma}{\int} \underset{ = 0 }{ \underbrace{ (P_x \Delta_H)_A{}^B(x,y) } } \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^B(y)} \, dvol_\Sigma(x) \, dvol_\Sigma(y) \\ & = 0 \end{aligned}

and similarly for the other order of the tensor products. Here the term over the brace vanishes by the fact that the Wightman propagator is a solution to the homogeneous equations of motion by prop. .

To analyze the behaviour of the free quantum BV-differential in general and specifically after passing to interacting field theory (below in chapter Interacting quantum fields) it is useful to re-express it in terms of the incarnation of the global antibracket with respect not to the pointwise product of observables, but the time-ordered product:

Definition

(time-ordered antibracket)

Then the time-ordered global antibracket on regular polynomial observables

PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \otimes PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \overset{\{-,-\}_{\mathcal{T}}}{\longrightarrow} PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

is the conjugation of the global antibracket (def. ) by the time-ordering operator $\mathcal{T}$ (from prop. ):

\{-,-\}_{\mathcal{T}} \;\coloneqq\; \mathcal{T}\left(\left\{ \mathcal{T}^{-1}(-), \mathcal{T}^{-1}(-)\right\}\right)

hence

\array{ PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \otimes PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] &\overset{\{-,-\}}{\longrightarrow}& PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \\ {}^{\mathllap{\mathcal{T}}}_{\mathllap{\simeq}}\Big\downarrow && \Big\downarrow{}^{\mathrlap{\mathcal{T}}}_\simeq \\ PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \otimes PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] &\overset{ \{-,-\}_{\mathcal{T}} }{\longrightarrow}& PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] }

(Fredenhagen-Rejzner 11, (27), Rejzner 11, (5.14))

Proposition

(time-ordered antibracket with gauge fixed action functional)

Then the time-ordered antibracket (def. ) with the gauge fixed BV-action functional $-S'$ (def. ) equals the conjugation of the global BV-differential with the isomorphism $\mathcal{T}$ from the pointwise to the time-ordered product of observables (from prop. )

\{-S',-\}_{\mathcal{T}} \;=\; \mathcal{T} \circ \{-S',-\} \circ \mathcal{T}^{-1} \,,

hence

\array{ PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] &\overset{ \{-S',-\} }{\longrightarrow}& PoyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \\ {}^{\mathllap{\mathcal{T}}}\Big\downarrow && \Big\downarrow{}^{\mathrlap{\mathcal{T}}} \\ PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] &\overset{ \{-S',-\}_{\mathcal{T}} }{\longrightarrow}& PoyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] }

Proof

By the assumption that $(E,\mathbf{L})$ is a free field theory its Euler-Lagrange equations are linear in the fields, and hence $S'$ is quadratic in the fields. This means that

\mathcal{T}^{-1}S' = S' + const \,,

where the second term on the right is independent of the fields, and hence that

\{\mathcal{T}^{-1}(-S'),-\} = \{-S', - \} \,.

This implies the claim:

\begin{aligned} \{-S',-\}_{\mathcal{T}} & \coloneqq \mathcal{T}\left(\{ \mathcal{T}^{-1}(-S'), \mathcal{T}^{-1}(-) \}\right) \\ & = \mathcal{T}\left(\{ -S', \mathcal{T}^{-1}(-) \}\right) \\ & = \mathcal{T} \circ \{-S',-\} \circ \mathcal{T}^{-1} \,. \end{aligned}

Definition

(BV-operator for gauge fixed free Lagrangian field theory)

Let $(E,\mathbf{L})$ be a free Lagrangian field theory (def. ) with gauge fixed BV-BRST Lagrangian density $-\mathbf{L}' + \mathbf{L}'_{BRST}$ (def. ) on a graded BV-BRST field bundle $E_{\text{BV-BRST}} \coloneqq T^\ast[-1]_{\Sigma,inf}(E \times_\Sigma \mathcal{G}[1] \times_{\Sigma} A \times_\Sigma A[-1])$ (remark ) and with corresponding gauge-fixed global BV-BRST differential on graded regular polynomial observables

\{-S' + S'_{BRST}, -\} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

(def. ).

Then the corresponding BV-operator

\Delta_{BV} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

on regular polynomial observables is, up to a factor of $i \hbar$ , the difference between the free component $\{-S',-\}$ of the gauge fixed global BV differential and its time-ordered version (def. )

\Delta_{BV} \;\coloneqq\; \tfrac{1}{i \hbar} \left( \left\{ -S',- \right\}_{\mathcal{T}} - \left\{ -S',(-) \right\} \right) \,,

hence

(221)

\{-S',-\}_{\mathcal{T}} \;=\; \{-S',-\} + i \hbar \Delta_{BV} \,.

Proposition

(BV-operator in components)

If the field bundles of all fields, ghost fields and auxiliary fields are trivial vector bundles, with field/ghost-field/auxiliary-field coordinates collectively denoted $(\phi^A)$ then the BV-operator $\Delta_{BV}$ from prop. is given explicitly by

\Delta_{BV} \;=\; \underset{a}{\sum} (-1)^{deg(\Phi^A)} \underset{\Sigma}{\int} \frac{\delta}{\delta \Phi^A(x)} \frac{\delta}{\delta \Phi^{\ddagger}_A(y)} dvol_\Sigma

Since this formula exhibits a graded Laplace operator, the BV-operator is also called the BV-Laplace operator or BV-Laplacian, for short.

(Fredenhagen-Rejzner 11, (29), Rejzner 11, (5.20))

Proof

By prop. we have equivalently

i \hbar \Delta_{BV} \;=\; \mathcal{T} \circ \{-S',-\} \circ \mathcal{T}^{-1} \,-\, \{-S',-\}

and by example the second term on the right is

\begin{aligned} \left\{ -S', -\right\} & = \underset{\Sigma}{\int} j^{\infty}\left(\mathbf{\Phi}\right)^\ast \left( \frac{\overset{\leftarrow}{\delta}_{EL} L}{\delta \phi^A} \right)(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \\ & = \underset{a}{\sum} (-1)^{deg(\phi^A)} \underset{}{\int} (P\mathbf{\Phi})_A(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \end{aligned}

With this we compute as follows:

(222)

\begin{aligned} \{-S',-\}_{\mathcal{T}} & = \mathcal{T} \circ \left\{ -S,-\right\} \circ \mathcal{T}^{-1} \\ & = \exp\left( \left[ \hbar \tfrac{1}{2} \Delta_F \left( \frac{\delta}{\delta \mathbf{\Phi}}, \frac{\delta}{\delta \mathbf{\Phi}} \right) \,,\, - \right] \right) \left( \{-S',-\} \right) \\ & = \{-S',-\} + \left[ \hbar \tfrac{1}{2} \Delta_F \left( \frac{\delta}{\delta \mathbf{\Phi}}, \frac{\delta}{\delta \mathbf{\Phi}} \right) \,, \{-S',-\} \right] + \underset{ = 0 }{\underbrace{\hbar^2(...)}} \\ & = \phantom{+} \left\{ -S' , -\right\} \\ & \phantom{=} + \left[ \tfrac{1}{2}\hbar \underset{\Sigma \times \Sigma}{\int} \Delta_F^{A B}(x,y) \frac{\delta^2}{\delta \mathbf{\Phi}^A(x) \delta \mathbf{\Phi}^B(y)} \, dvol_\Sigma(x) \, dvol_\Sigma(y) \;,\; \underset{a}{\sum} (-1)^{deg(\phi^A)} \underset{\Sigma}{\int} (P\mathbf{\Phi})_A(x) \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \right] \\ & = \left\{ -S', -\right\} \\ & \phantom{=} + \underset{A}{\sum} (-1)^{deg(\phi^A)} \underset{\Sigma \times \Sigma}{\int} \underset{ = i \delta(x-y) }{\underbrace{P_x \Delta_F(x,y)}} \frac{\delta}{\delta \mathbf{\Phi}^A(x)} \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(y)} \, dvol_\Sigma(x) \, dvol_\Sigma(y) \\ & = \left\{ -S', -\right\} + i \hbar \underset{A}{\sum} (-1)^{deg(\phi^A)} \underset{\Sigma}{\int} \frac{\delta}{\delta \mathbf{\Phi}^A(x)} \frac{\delta}{\delta \mathbf{\Phi}^\ddagger_A(x)} \, dvol_\Sigma(x) \end{aligned}

Here we used

under the first brace that by assumption of a free field theory, $\{-S',-\}$ is linear in the fields, so that the first commutator with the Feynman propagator is independent of the fields, and hence all the higher commutators vanish;
under the second brace that the Feynman propagator is $+i$ times the Green function for the Green hyperbolic Euler-Lagrange equations of motion (cor. ).

Proposition

(global antibracket exhibits failure of BV-operator to be a derivation)

The BV-operator $\Delta_{BV}$ (def. ) and the global antibracket $\{-,-\}$ (def. ) satisfy for all polynomial observables (def. ) $A_1, A_2 \in PolyObs(E_{\text{BV-BRST}})[ [\hbar] ]$ the relation

(223)

\{A_1, A_2\} \;=\; (-1)^{deg(A_2)} \, \Delta_{BV}(A_1 \cdot A_2) - (-1)^{deg(A_2)} \, \Delta_{BV}(A_1) \cdot A_2 - A_1 \cdot \Delta_{BV}(A_2)

for $(-) \cdot (-)$ the pointwise product of observables (def. ).

Moreover, it commutes on regular polynomial observables with the time-ordering operator $\mathcal{T}$ (prop. )

\Delta_{BV} \circ \mathcal{T} = \mathcal{T} \circ \Delta_{BV} \phantom{AAA} \text{on} \,\, PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

and hence satisfies the analogue of relation (223) also for the time-ordered antibracket $\{-,-\}_{\mathcal{T}}$ (def. ) and the time-ordered product $\star_F$ on regular polynomial observables

\{A_1, A_2\}_{\mathcal{T}} \;=\; (-1)^{deg(A_2)} \, \Delta_{BV}(A_1 \star_F A_2) - (-1)^{dag(A_2)} \Delta_{BV}(A_1) \star_F A_2 - A_1 \star_F \Delta_{BV}(A_2) \,.

(e.g. Henneaux-Teitelboim 92, (15.105d))

Proof

With prop. the first statement is a graded version of the analogous relation for an ordinary Laplace operator $\Delta \coloneqq g^{a b} \partial_a \partial_b$ acting on smooth functions on Cartesian space, which on smooth functions $f,g$ satisfies

\Delta(f \cdot g) \;=\; (\nabla f, \nabla g) - \Delta(f) g - f \Delta(g) \,,

by the product law for differentiation, where now $\nabla f \coloneqq (g^{a b} \partial_b f)$ is the gradient and $(v,w) \coloneqq g_{a b} v^a w b$ the inner product. Here one just needs to carefully record the relative signs that appear.

That the BV-operator commutes with the time-ordering operator is clear from the fact that both of these are given by partial functional derivatives with constant coefficients. This immediately implies the last statement from the first.

Example

(BV-operator on time-ordered exponentials)

Let moreover $V \in PolyObs(E_{\text{BV-BRST}})_{reg, deg = 0}[ [\hbar] ]$ be a regular polynomial observable (def. ) of degree zero. Then the application of the BV-operator $\Delta_{BV}$ (def. ) to the time-ordered exponential $\exp_{\mathcal{T}}(V)$ (example ) is the time-ordered product of the time-ordered exponential with the sum of $\Delta_{BV}(V)$ and the global antibracket $\tfrac{1}{2}\{V,V\}$ of $V$ with itself:

\Delta_{BV} \left( \exp_{\mathcal{T}}(V) \right) \;=\; \left( \Delta_{BV}(V) + \tfrac{1}{2}\{V,V\} \right) \star_F \exp_{\mathcal{T}}(V)

Proof

By prop. $\Delta_{BV}$ acts as a derivation on the time-ordered product up to a correction given by the antibracket of the two factors. This yields the result by the usual combinatorics of exponentials.

\begin{aligned} & \Delta_{BV} \left( 1 + V + \tfrac{1}{2}V \star_F V + \cdots \right) \\ & = \Delta_{BV}(V) + \tfrac{1}{2}\left( \Delta_{BV}(V) \star_F V + V \star_F \Delta_{BV}(V) \right) + \tfrac{1}{2}\{V,V\} + \cdots \\ & = \Delta_{BV}(V) + \tfrac{1}{2}\{V,V\} \;+\; \Delta_{BV}(V) \star_F V + \cdots \end{aligned}

$\,$

Schwinger-Dyson equation

A special case of the general occurrence of the BV-operator is the following important property of on-shell time-ordered products:

Proposition

(Schwinger-Dyson equation)

Let

(224)

A \;\coloneqq\; \underset{\Sigma}{\int} A^a(x) \cdot \mathbf{\Phi}^\ddagger_a(x) \, dvol_\Sigma(x) \;\in\; PolyObs_{reg}(E_{\text{BV-BRST}})

be an off-shell regular polynomial observable which is linear in the antifield field observables $\mathbf{\Phi}^\ddagger$ . Then

(225)

\mathcal{T}^{\pm 1} \left( \underset{\Sigma}{\int} \frac{\delta S'}{\delta \mathbf{\Phi}^a(x)} \cdot A^a(x) \, dvol_\Sigma(x) \right) \;=\; \pm i \hbar \, \mathcal{T}^{\pm} \left( \underset{\Sigma}{\int} \frac{\delta A^a(x)}{\delta \mathbf{\Phi}^a(x)} \, dvol_\Sigma(x) \right) \phantom{A} \in \underset{ \text{on-shell} }{ \underbrace{ PolyObs_{reg}(E_{\text{BV-BRST}}, \mathbf{L'}) }} \,.

This is called the Schwinger-Dyson equation.

The following proof is due to (Rejzner 16, remark 7.7) following the informal traditional argument (Henneaux-Teitelboim 92, (15.108b)).

Proof

Applying the inverse time-ordering map $\mathcal{T}^{-1}$ (prop. ) to equation (221) applied to $A$ yields

\underset{ \mathcal{T}^{-1} \underset{\Sigma}{\int} \frac{\delta S'}{\delta \mathbf{\Phi}^a(x)} \cdot A^a(x) dvol_\Sigma(x) }{ \underbrace{ \mathcal{T}^{-1}\left\{ -S', A\right\} } } \;=\; - \underset{ i \hbar \mathcal{T}^{-1} \underset{\Sigma}{\int} \frac{\delta A^a(x)}{\delta \mathbf{\Phi}^a(x)} \, dvol_\Sigma }{ \underbrace{ i \hbar \mathcal{T}^{-1}\Delta_{BV}(A) } } + \underset{ \{-S',\mathcal{T}^{-1}(A)\} }{ \underbrace{ \mathcal{T}^{-1}\left\{ -S',A\right\}_{\mathcal{T}} } }

where we have identified the terms under the braces by 1) the component expression for the BV-differential $\{-S',-\}$ from prop. , 2) prop. and 3) prop. .

The last term is manifestly in the image of the BV-differential $\{-S',-\}$ and hence vanishes when passing to on-shell observables along the isomorphism (198)

\underset{ \text{on-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}}, \mathbf{L}') }} \;\simeq\; \underset{ \text{off-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}})_{def(af = 0)} }}/im(\{-S',-\})

(by example ).

The same argument with the replacement $\mathcal{T} \leftrightarrow \mathcal{T}^{-1}$ throughout yields the other version of the equation (with time-ordering instead of reverse time ordering and the sign of the $\hbar$ -term reversed).

Remark

(“Schwinger-Dyson operator”)

The proof of the Schwinger-Dyson equation in prop. shows that, up to time-ordering, the Schwinger-Dyson equation is the on-shell vanishing of the “quantized” BV-differential (221)

\{-S',-\}_{\mathcal{T}} \;=\; \{-S', -\} \,+\, i \hbar \, \Delta_{BV} \,,

where the BV-operator is the quantum correction of order $\hbar$ . Therefore this is also called the Schwinger-Dyson operator (Henneaux-Teitelboim 92, (15.111)).

Example

(distributional Schwinger-Dyson equation)

Often the Schwinger-Dyson equation (prop. ) is displayed before spacetime-smearing of field observables in terms of operator products of operator-valued distributions, taking the observable $A$ in (224) to be

A^a(x) \;\coloneqq\; \delta(x-x_0) \delta^a_{a_0} \mathbf{\Phi}^{a_1}(x_1) \cdots \mathbf{\Phi}^{a_n}(x_n) \,.

This choice makes (225) become the distributional Schwinger-Dyson equation

\begin{aligned} & T \left( \frac{\delta S}{\delta \mathbf{\Phi}^{a_0}(x_0)} \cdot \mathbf{\Phi}^{a_1}(x_1) \cdots \mathbf{\Phi}^{a_n}(x_n) \right) \\ & \underset{\text{on-shell}}{=} - i \hbar \underset{k}{\sum} T \left( \mathbf{\Phi}^{a_1}(x_1) \cdots \mathbf{\Phi}^{a_{k-1}}(x_{k-1}) \cdot \delta(x_0 - x_k) \delta^{a_0}_{a_k} \cdot \mathbf{\Phi}^{a_{k+1}}(x_{k+1}) \cdots \mathbf{\Phi}^{a_n}(x_m) \right) \end{aligned}

(e.q. Dermisek 09).

In particular this means that if $(x_0,a_0) \neq (x_k, a_k)$ for all $k \in \{1,\cdots ,n\}$ then

T \left( \frac{\delta S}{\delta \mathbf{\Phi}^{a_0}(x_0)} \cdot \mathbf{\Phi}^{a_1}(x_1) \cdots \mathbf{\Phi}^{a_n}(x_n) \right) \;=\; 0 \phantom{AAA} \text{on-shell}

Since by the principle of extremal action (prop. ) the equation

\frac{\delta S}{\delta \mathbf{\Phi}^{a_0}(x_0)} \;=\; 0

is the Euler-Lagrange equation of motion (for the classical field theory) “at $x_0$ ”, this may be interpreted as saying that the classical equations of motion for fields at $x_0$ still hold for time-ordered quantum expectation values, as long as all other observables are evaluated away from $x_0$ ; while if observables do coincide at $x_0$ then there is a correction measured by the BV-operator.

$\,$

This concludes our discussion of the algebra of quantum observables for free field theories. In the next chapter we discuss the perturbative QFT of interacting field theories as deformations of such free quantum field theories.

$\,$

Interacting quantum fields

In this chapter we discuss the following topics:

Free field vacua
Perturbative S-matrices
Conceptual remarks
Interacting field observables
Time-ordered products
(“Re”-)Normalization
Feynman perturbation series
Effective action
Vacuum diagrams
Interacting quantum BV-differential
Ward identities

$\,$

In the previous chapter we have found the quantization of free Lagrangian field theories by first choosing a gauge fixed BV-BRST-resolution of the algebra of gauge invariant on-shell observabes, then applying algebraic deformation quantization induced by the resulting Peierls-Poisson bracket on the graded covariant phase space to pass to a non-commutative algebra of quantum observables, such that, finally, the BV-BRST differential is respected.

Of course most quantum field theories of interest are non-free; they are interacting field theories whose equations of motion is a non-linear differential equation. The archetypical example is the coupling of the Dirac field to the electromagnetic field via the electron-photon interaction, corresponding to the interacting field theory called quantum electrodynamics (discussed below).

In principle the perturbative quantization of such non-free field theory interacting field theories proceeds the same way: One picks a BV-BRST-gauge fixing, computes the Peierls-Poisson bracket on the resulting covariant phase space (Khavkine 14) and then finds a formal deformation quantization of this Poisson structure to obtain the quantized non-commutative algebra of quantum observables, as formal power series in Planck's constant $\hbar$ .

It turns out (Collini 16, Hawkins-Rejzner 16, prop. below) that the resulting interacting formal deformation quantization may equivalently be expressed in terms of scattering amplitudes (example below): These are the probability amplitudes for plane waves of free fields to come in from the far past, then interact in a compact region of spacetime via the given interaction (adiabatically switched-off outside that region) and to emerge again as free fields into the far future.

The collection of all these scattering amplitudes, as the types and wave vectors of the incoming and outgoing free fields varies, is called the perturbative scattering matrix of the interacting field theory, or just S-matrix for short. It may equivalently be expressed as the exponential of time-ordered products of the adiabatically switched interaction action functional with itself (def. below). The combinatorics of the terms in this exponential is captured by Feynman diagrams (prop. below), which, with some care (remark below), may be thought of as finite multigraphs (def. below) whose edges are worldlines of virtual particles and whose vertices are the interactions that these particles undergo (def. below).

The axiomatic definition of S-matrices for relativistic Lagrangian field theories and their rigorous construction via ("re"-)normalization of time-ordered products (def. below) is called causal perturbation theory, due to (Epstein-Glaser 73). This makes precise and well-defined the would-be path integral quantization of interacting field theories (remark below) and removes the errors (remark below) and ensuing puzzlements (expressed in Feynman 85) that plagued the original informal conception of perturbative quantum field theory due to Schwinger-Tomonaga-Feynman-Dyson (remark below).

The equivalent re-formulation of the formal deformation quantization of interacting field theories in terms of scattering amplitudes (prop. below) has the advantage that it gives a direct handle on those observables that are measured in scattering experiments, such as the LHC-experiment. The bulk of mankind’s knowledge about realistic perturbative quantum field theory – such as notably the standard model of particle physics – is reflected in such scattering amplitudes given via their Feynman perturbation series in formal powers of Planck's constant and the coupling constant.

Moreover, the mathematical passage from scattering amplitudes to the actual interacting field algebra of quantum observables (def. below) corresponding to the formal deformation quantization is well understood, given via “Bogoliubov's formula” by the quantum Møller operators (def. below).

Via Bogoliubov's formula every perturbative S-matrix scheme (def. ) induces for every choice of adiabatically switched interaction action functional a notion of perturbative interacting field observables (def. ). These generate an algebra (def. below). By Bogoliubov's formula, in general this algebra depends on the choice of adiabatic switching; which however is not meant to be part of the physics, but just a mathematical device for grasping global field structures locally.

But this spurious dependence goes away (prop. below) when restricting attention to observables whose spacetime support is inside a compact causally closed subsets $\mathcal{O}$ of spacetime (def. below). This is a sensible condition for an observable in physics, where any realistic experiment nessecarily probes only a compact subset of spacetime, see also remark .

The resulting system (a “co-presheaf”) of well-defined perturbative interacting field algebras of observables (def. below)

\mathcal{O} \mapsto IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O})

is in fact causally local (prop. below). This fact was presupposed without proof already in Il’in-Slavnov 78; because this is one of two key properties that the Haag-Kastler axioms (Haag-Kastler 64) demand of an intrinsically defined quantum field theory (i.e. defined without necessarily making recourse to the geometric backdrop of Lagrangian field theory). The only other key property demanded by the Haag-Kastler axioms is that the algebras of observables be C*-algebras; this however must be regarded as the axiom encoding non-perturbative quantum field theory and hence is necessarily violated in the present context of perturbative QFT. Since quantum field theory following the full Haag-Kastler axioms is commonly known as AQFT, this perturbative version, with causally local nets of observables but without the C*-algebra-condition on them, has come to be called perturbative AQFT (Dütsch-Fredenhagen 01, Fredenhagen-Rejzner 12).

In this terminology the content of prop. below is that while the input of causal perturbation theory is a gauge fixed Lagrangian field theory, the output is a perturbative algebraic quantum field theory:

\array{ \array{ \text{gauge-fixed} \\ \text{Lagrangian} \\ \text{field theory} } & \overset{ \array{ \text{causal} \\ \text{perturbation theory} \\ } }{\longrightarrow}& \array{ \text{perturbative} \\ \text{algebraic} \\ \text{quantum} \\ \text{field theory} } \\ \underset{ \array{ \text{(Becchi-Rouet-Stora 76,} \\ \text{Batalin-Vilkovisky 80s)} } }{\,} & \underset{ \array{ \text{(Bogoliubov-Shirkov 59,} \\ \text{Epstein-Glaser 73)} } }{\,} & \underset{ \array{ \text{ (Il'in-Slavnov 78, } \\ \text{Brunetti-Fredenhagen 99,} \\ \text{Dütsch-Fredenhagen 01)} } }{\,} }

The independence of the causally local net of localized interacting field algebras of observables $IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int} )(\mathcal{O})$ from the choice of adiabatic switching implies a well-defined spacetime-global algebra of observables by forming the inductive limit

IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int}) \;\coloneqq\; \underset{\underset{\mathcal{O}}{\longrightarrow}}{\lim} \left( {\, \atop \,} IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}) {\, \atop \,} \right) \,.

This is also called the algebraic adiabatic limit, defining the algebras of observables of perturbative QFT “in the infrared”. The only remaining step in the construction of a perturbative QFT that remains is then to find an interacting vacuum state

\left\langle - \right\rangle_{int} \;\colon\; IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int}) \longrightarrow \mathbb{C}[ [ \hbar, g] ]

on the global interacting field algebra $Obs_{\mathbf{L}_{int}}$ . This is related to the actual adiabatic limit, and it is by and large an open problem, see remark below.

In conclusion so far, the algebraic adiabatic limit yields, starting with a BV-BRST gauge fixed free field vacuum, the perturbative construction of interacting field algebras of observables (def. ) and their organization in increasing powers of $\hbar$ and $g$ (loop order, prop. ) via the Feynman perturbation series (example , example ).

But this interacting field algebra of observables still involves all the auxiliary fields of the BV-BRST gauge fixed free field vacuum (as in example for QED), while the actual physical gauge invariant on-shell observables should be (just) the cochain cohomology of the BV-BRST differential on this enlarged space of observables. Hence for the construction of perturbative QFT to conclude, it remains to pass the BV-BRST differential of the free field Wick algebra of observables to a differential on the interacting field algebra, such that its cochain cohomology is well defined.

Since the time-ordered products away from coinciding interaction points are uniquely fixed (prop. below), one finds that also this interacting quantum BV-differential is uniquely fixed, on regular polynomial observables, by conjugation with the quantum Møller operators (def. ). The formula that characterizes it there is called the quantum master equation or equivalently the quantum master Ward identity (prop. below).

In its incarnation as the master Ward identity, this expresses the difference between the shell of the free classical field theory and that of the interacting quantum field theory, thus generalizing the Schwinger-Dyson equation to interacting field theory (example below). Applied to Noether's theorem it expresses the possible failure of conserved currents associated with infinitesimal symmetries of the Lagrangian to still be conserved in the interacting perturbative QFT (example below).

As one extends the time-ordered products to coinciding interaction points in ("re"-)normalization of the perturbative QFT (def. below), the quantum master equation/master Ward identity becomes a renormalization condition (prop. below). If this condition fails one says that the interacting perturbative QFT has a quantum anomaly, specifically a gauge anomaly if the Ward identity of an infinitesimal gauge symmetry is violated.

These issues of "(re)-"normalization we discuss in detail in the next chapter.

$\,$

Free field vacua

In considering perturbative QFT, we are considering perturbation theory in formal deformation parameters around a fixed free Lagrangian quantum field theory in a chosen Hadamard vacuum state.

For convenient referencing we collect all the structure and notation that goes into this in the following definitions:

Definition

(free relativistic Lagrangian quantum field vacuum)

Let

$\Sigma$ be a spacetime (e.g. Minkowski spacetime);
$(E,\mathbf{L})$ a free Lagrangian field theory (def. ), with field bundle $E \overset{fb}{\to} \Sigma$ ;
$\mathcal{G} \overset{fb}{\to} \Sigma$ a gauge parameter bundle for $(E,\mathbf{L})$ (def. ), with induced BRST-reduced Lagrangian field theory $\left( E \times_\Sigma \mathcal{G}[1], \mathbf{L} - \mathbf{L}_{BRST}\right)$ (example );
$(E_{\text{BV-BRST}}, \mathbf{L}' - \mathbf{L}'_{BRST})$ a gauge fixing (def. ) with graded BV-BRST field bundle $E_{\text{BV-BRST}} = T^\ast_{\Sigma}[-1]\left( E\times_\Sigma \mathcal{G}[1] \times_\Sigma A \times_\Sigma A[-1]\right)$ (remark );
$\Delta_H \in \Gamma'( E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}} )$ a Wightman propagator $\Delta_H = \tfrac{i}{2} \Delta + H$ compatible with the causal propagator $\Delta$ which corresponds to the Green hyperbolic Euler-Lagrange equations of motion induced by the gauge-fixed Lagrangian density $\mathbf{L}'$ .

Given this, we write

\left( {\, \atop \,} PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar] ] \;,\; \star_H {\, \atop \,} \right)

for the corresponding Wick algebra-structure on formal power series in $\hbar$ (Planck's constant) of microcausal polynomial observables (def. ). This is a star algebra with respect to (coefficient-wise) complex conjugation (prop. ).

Write

(226)

\array{ PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar] ] &\overset{\langle - \rangle}{\longrightarrow}& \mathbb{C}[ [\hbar] ] \\ A &\mapsto& A(\Phi = 0) }

for the induced Hadamard vacuum state (prop. ), hence the state whose distributional 2-point function is the chosen Wightman propagator:

\left\langle \mathbf{\Phi}^a(x) \mathbf{\Phi}^b(y)\right\rangle \;=\; \hbar \, \Delta_H^{a b}(x,y) \,.

Given any microcausal polynomial observable $A \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]$ then its value in this state is called its free vacuum expectation value

\left\langle A \right\rangle \;\in\; \mathbb{C}[ [ \hbar, g, j] ] \,.

Write

(227)

\array{ LocObs(E_{\text{BV-BRST}}) &\overset{\phantom{A}:(-):\phantom{A}}{\hookrightarrow}& PolyObs(E_{\text{BV-BRST}})_{mc} \\ A &\mapsto& :A: }

for the inclusion of local observables (def. ) into microcausal polynomial observables (example ), thought of as forming normal-ordered products in the Wick algebra (by def. ).

We denote the Wick algebra-product (the star product $\star_H$ induced by the Wightman propagator $\Delta_H$ according to prop. ) by juxtaposition (def. )

A_1 A_2 \;\coloneqq\; A_1 \star_H A_2 \,.

If an element $A \in PolyObs(E_{\text{BV-BRST}})$ has an inverse with respect to this product, we denote that by $A^{-1}$ :

A^{-1} A = 1 \,.

Finally, for $A \in LocObs(E_{\text{BV-BRST}})$ we write $supp(A) \subset \Sigma$ for its spacetime support (def. ). For $S_1, S_2 \subset \Sigma$ two subsets of spacetime we write

S_1 {\vee\!\!\!\wedge} S_2 \phantom{AAA} \left\{ \array{ \text{"}S_1 \, \text{does not intersect the past of} \, S_2\text{"} \\ \Updownarrow \\ \text{"}S_2 \, \text{does not intersect the future of} \, S_1\text{"} } \right.

for the causal order-relation (def. ) and

S_1 {\gt\!\!\!\!\lt} S_2 \phantom{AAA} \text{for} \phantom{AAA} \array{ S_1 {\vee\!\!\!\wedge} S_2 \\ \text{and} \\ S_2 {\vee\!\!\!\wedge} A_1 }

for spacelike separation.

Being concerned with perturbation theory means mathematically that we consider formal power series in deformation parameters $\hbar$ (“Planck's constant”) and $g$ (“coupling constant”), also in $j$ (“source field”), see also remark . The following collects our notational conventions for these matters:

Definition

(formal power series of observables for perturbative QFT)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Write

LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ] \;\coloneqq\; \underset{ k_1, k_2, k_3 \in \mathbb{N}}{\prod} LocObs(E_{\text{BV-BRST}})\langle \hbar^{k_1} g^{k_2} j^{k^3}\rangle

for the space of formal power series in three formal variables

$\hbar$ (“Planck's constant”),
$g$ (“coupling constant”)
$j$ (“source field”)

with coefficients in the topological vector spaces of the off-shell polynomial local observables of the free field theory (def. ); similarly for the off-shell microcausal polynomial observables (def. ):

PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j ] ] \;\coloneqq\; \underset{ k_1, k_2, k_3 \in \mathbb{N}}{\prod} PolyObs(E_{\text{BV-BRST}})_{mc}\langle \hbar^{k_1} g^{k_2} j^{k^3}\rangle \,.

Similary

LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ] \,, \phantom{AAA} PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]

denotes the subspace for which no powers of $j$ appear, etc.

Accordingly

C^\infty_{cp}(\Sigma) \langle g \rangle

denotes the vector space of bump functions on spacetime tensored with the vector space spanned by a single copy of $g$ . The elements

g_{sw} \in C^\infty_{cp}(\Sigma)\langle g \rangle

may be regarded as spacetime-dependent “coupling constants” with compact support, called adiabatically switched couplings.

Similarly then

LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]\langle g , j \rangle

is the subspace of those formal power series that are at least linear in $g$ or $j$ (hence those that vanish if one sets $g,j = 0$ ). Hence every element of this space may be written in the form

O = g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]\langle g , j \rangle \,,

where the notation is to suggest that we will think of the coefficient of $g$ as an (adiabatically switched) interaction action functional and of the coefficient of $j$ as an external source field (reflected by internal and external vertices, respectively, in Feynman diagrams, see def. below).

In particular for

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g] ]

a formal power series in $\hbar$ and $g$ of local Lagrangian densities (def. ), thought of as a local interaction Lagrangians, and if

g_{sw} \;\in\; C^\infty_{cp}(\Sigma) \langle g \rangle

is an adiabatically switched coupling as before, then the transgression (def. ) of the product

g_{sw} \mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_{\Sigma,cp}(E_{\text{BV-BRST}})[ [ \hbar ,g ] ]\langle g \rangle

is such an adiabatically switched interaction

g S_{int} \;=\; \tau_\Sigma( g_{sw} \mathbf{L}_{int} ) \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g] ]\langle g \rangle \,.

We also consider the space of off-shell microcausal polynomial observables of the free field theory with formal parameters adjoined

PolyObs(E_{\text{BV-BRST}})_{mc} ((\hbar)) [ [ g , j] ] \,,

which, in its $\hbar$ -dependent, is the space of Laurent series in $\hbar$ , hence the space exhibiting also negative formal powers of $\hbar$ .

$\,$

Perturbative S-Matrices

We introduce now the axioms for perturbative scattering matrices relative to a fixed relativistic free Lagrangian quantum field vacuum (def. below) according to causal perturbation theory (def. below). Since the first of these axioms requires the S-matrix to be a formal sum of multi-linear continuous functionals, it is convenient to impose axioms on these directly: this is the axiomatics for time-ordered products in def. below. That these latter axioms already imply the former is the statement of prop. , prop. below . Its proof requires a close look at the “reverse-time ordered products” for the inverse S-matrix (def. below) and their induced reverse-causal factorization (prop. below).

Definition

(S-matrix axioms – causal perturbation theory)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Then a perturbative S-matrix scheme for perturbative QFT around this free vacuum is a function

\mathcal{S} \;\;\colon\;\; LocObs(E_{\text{BV-BRST}})[ [\hbar , g, j] ]\langle g, j \rangle \overset{\phantom{AAA}}{\longrightarrow} PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g, j ] ]

from local observables to microcausal polynomial observables of the free vacuum theory, with formal parameters adjoined as indicated (def. ), such that the following two conditions “perturbation” and “causal additivity (jointly: ”causal perturbation theory“) hold:

(perturbation)

There exist multi-linear continuous functionals (over $\mathbb{C}[ [\hbar, g, j] ]$ ) of the form

(228) $T_k \;\colon\; \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ]\langle g, j \rangle {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j] ]}} \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g, j ] ]$

for all $k \in \mathbb{N}$ , such that:
1. The nullary map is constant on the unit of the Wick algebra
  
  $T_0( g S_{int} + j A) = 1$
2. The unary map is the inclusion of local observables as normal-ordered products (227)
  
  $T_1(g S_{int} + j A) = g :S_{int}: + j :A:$
3. The perturbative S-matrix is the exponential series of these maps in that for all $g S_{int} + j A \in LocObs(E_{\text{BV-BRST}})[ [\hbar , g, j] ]\langle g,j\rangle$
  
  (229) $\begin{aligned} \mathcal{S}( g S_{int} + j A) & = T \left( \exp_{\otimes} \left( \tfrac{ 1 }{i \hbar} \left( g S_{int} + j A \right) \right) \right) \\ & \coloneqq \underoverset{k = 0}{\infty}{\sum} \frac{1}{k!} \left( \frac{1}{i \hbar} \right)^k T_k \left( {\, \atop \,} \underset{k\,\text{arguments}}{\underbrace{ (g S_{int} + jA) , \cdots, (g S_{int} + j A) }} {\, \atop \,} \right) \end{aligned}$
(causal additivity)

For all perturbative local observables $O_0, O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ]$ we have

(230) $\left( {\, \atop \,} supp( O_1 ) {\vee\!\!\!\wedge} supp( O_2 ) {\, \atop \,} \right) \;\; \Rightarrow \;\; \left( {\, \atop \,} \mathcal{S}( O_0 + O_1 + O_2 ) \;\, \mathcal{S}( O_0 + O_1 ) \, \mathcal{S}( O_0 )^{-1} \, \mathcal{S}(O_0 + O_2) {\, \atop \,} \right) \,.$

(The inverse $\mathcal{S}(O)^{-1}$ of $\mathcal{S}(O)$ with respect to the Wick algebra-structure is implied to exist by the axiom “perturbation”, see remark below.)

Def. is due to (Epstein-Glaser 73 (1)), following (Stückelberg 49-53, Bogoliubov-Shirkov 59). That the domain of an S-matrix scheme is indeed the space of local observables was made explicit (in terms of axioms for the time-ordered products, see def. below), in (Brunetti-Fredenhagen 99, section 3, Dütsch-Fredenhagen 04, appendix E, Hollands-Wald 04,around (20)). Review includes (Rejzner 16, around def. 6.7, Dütsch 18, section 3.3).

Remark

(invertibility of the S-matrix)

The mutliplicative inverse $S(-)^{-1}$ of the perturbative S-matrix in def. with respect to the Wick algebra-product indeed exists, so that the list of axioms is indeed well defined: By the axiom “perturbation” this follows with the usual formula for the multiplicative inverse of formal power series that are non-vanishing in degree 0:

If we write

\mathcal{S}(g S_{int} + j A) = 1 + \mathcal{D}(g S_{int} + j A)

then

(231)

\begin{aligned} \left( {\, \atop \,} \mathcal{S}(g S_{int} + j A) {\, \atop \,} \right)^{-1} &= \left( {\, \atop \,} 1 + \mathcal{D}(g S_{int} + j A) {\, \atop \,} \right)^{-1} \\ & = \underoverset{r = 0}{\infty}{\sum} \left( {\, \atop \,} -\mathcal{D}(g S_{int} + j A) {\, \atop \,} \right)^r \end{aligned}

where the sum does exist in $PolyObs(E_{\text{BV-BRST}})((\hbar))[ [[ g,j ] ]$ , because (by the axiom “perturbation”) $\mathcal{D}(g S_{int} + j A)$ has vanishing coefficient in zeroth order in the formal parameters $g$ and $j$ , so that only a finite sub-sum of the formal infinite sum contributes in each order in $g$ and $j$ .

This expression for the inverse of S-matrix may usefully be re-organized in terms of “rever-time ordered products” (def. below), see prop. below.

Notice that $\mathcal{S}(-g S_{int} - j A )$ is instead the inverse with respect to the time-ordered products (228) in that

T( \mathcal{S}(-g S_{int} - j A ) \,,\, \mathcal{S}(g S_{int} + j A) ) \;=\; 1 \;=\; T( \mathcal{S}(g S_{int} + j A ) \,,\, \mathcal{S}(-g S_{in} - j A ) ) \,.

(Since the time-ordered product is, by definition, symmetric in its arguments, the usual formula for the multiplicative inverse of an exponential series applies).

Remark

(adjoining further deformation parameters)

The definition of S-matrix schemes in def. has immediate variants where arbitrary countable sets $\{g_n\}$ and $\{j_m\}$ of formal deformation parameters are considered, instead of just a single coupling constant $g$ and a single source field $j$ . The more such constants are considered, the “more perturbative” the theory becomes and the stronger the implications.

Given a perturbative S-matrix scheme (def. ) it immediately induces a corresponding concept of observables:

Definition

(generating function scheme for interacting field observables)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. .

The corresponding generating function scheme (for interacting field observables, def. below) is the functional

\mathcal{Z}_{(-)}(-) \;\colon\; LocObs(E_{\text{BV-BRST}})[ [\hbar, g] ]\langle g \rangle \;\times\; LocObs(E_{\text{BV-BRST}})[ [\hbar, j] ]\langle j \rangle \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [g , j] ]

given by

(232)

\mathcal{Z}_{g S_{int}}(j A) \;\coloneqq\; \mathcal{S}(g S_{int})^{-1} \mathcal{S}( g S_{int} + j A ) \,.

Proposition

(causal additivity in terms of generating functions)

In terms of the generating functions $\mathcal{Z}$ (def. ) the axiom “causal additivity” on the S-matrix scheme $\mathcal{S}$ (def. ) is equivalent to:

(causal additivity in terms of $\mathcal{Z}$ )

For all local observables $O_0, O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ]\otimes\mathbb{C}\langle g,j\rangle$ we have

(233) $\begin{aligned} \left( {\, \atop \,} supp(O_1) {\vee\!\!\!\wedge} supp(O_2) {\, \atop \,} \right) & \;\; \Rightarrow \;\; \left( {\, \atop \,} \mathcal{Z}_{O_0}( O_1 ) \, \mathcal{Z}_{O_0}( O_2) = \mathcal{Z}_{ O_0 }( O_1 + O_2 ) {\, \atop \,} \right) \\ & \;\; \Leftrightarrow \;\; \left( {\, \atop \,} \mathcal{Z}_{ O_0 + O_1 }( O_2 ) = \mathcal{Z}_{ O_0 }( O_2 ) {\, \atop \,} \right) \end{aligned} \,.$

(Whence “additivity”.)

Proof

This follows by elementary manipulations:

Multiplying both sides of (230) by $\mathcal{S}(O_0)^{-1}$ yields

\underset{ \mathcal{Z}_{ O_0 }( O_1 + O_2 ) }{ \underbrace{ \mathcal{S}( O_0 )^{-1} \mathcal{S}( O_0 + O_1 + O_2 ) } } \;=\; \underset{ \mathcal{Z}_{ O_0 }( O_1 ) }{ \underbrace{ \mathcal{S}( O_0 )^{-1} \mathcal{S}( O_0 + O_1 ) } } \underset{ \mathcal{Z}_{ O_0 }( O_2 ) }{ \underbrace{ \mathcal{S}( O_0 )^{-1} \mathcal{S}( O_0 + O_2 ) } }

This is the first line of (233).

Multiplying both sides of (230) by $\mathcal{S}( O_0 + O_1 )^{-1}$ yields

\underset{ = \mathcal{Z}_{ O_0 + O_1 }( O_2 ) }{ \underbrace{ \mathcal{S}( O_0 + O_1 )^{-1} \mathcal{S}( O_0 + O_1 + O_2 ) } } \;=\; \underset{ = \mathcal{Z}_{ O_0 }( O_2 ) }{ \underbrace{ \mathcal{S}( O_0 )^{-1} \mathcal{S}( O_0 + O_2 ) } } \,.

This is the second line of (233).

Definition

(interacting field observables – Bogoliubov's formula)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let $g S_{int} \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]\langle g \rangle$ be a local observable regarded as an adiabatically switched interaction-functional.

Then for $A \in LocObs(E_{\text{BV-BRST}})[ [\hbar , g] ]$ a local observable of the free field theory, we say that the corresponding local interacting field observable

A_{int} \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar, g] ]

is the coefficient of $j^1$ in the generating function (232):

(234)

\begin{aligned} A_{int} &\coloneqq i \hbar \frac{d}{d j} \left( {\, \atop \,} \mathcal{Z}_{ g S_{int} }( j A ) {\, \atop \,} \right)_{\vert_{j = 0}} \\ & \coloneqq i \hbar \frac{d}{d j} \left( {\, \atop \,} \mathcal{S}(g S_{int})^{-1} \, \mathcal{S}( g S_{int} + j A ) {\, \atop \,} \right)_{\vert_{j = 0}} \\ & = \mathcal{S}(g S_{int})^{-1} T\left( \mathcal{S}(g S_{int}), A \right) \,. \end{aligned}

This expression is called Bogoliubov's formula, due to (Bogoliubov-Shirkov 59).

One thinks of $A_{int}$ as the deformation of the local observable $A$ as the interaction $S_{int}$ is turned on; and speaks of an element of the interacting field algebra of observables. Their value (“expectation value”) in the given free Hadamard vacuum state $\langle -\rangle$ (def. ) is a formal power series in Planck's constant $\hbar$ and in the coupling constant $g$ , with coefficients in the complex numbers

\left\langle A_{int} \right\rangle \;\in\; \mathbb{C}[ [\hbar, g] ]

which express the probability amplitudes that reflect the predictions of the perturbative QFT, which may be compared to experiment.

(Epstein-Glaser 73, around (74)); review includes (Dütsch-Fredenhagen 00, around (17), Dütsch 18, around (3.212)).

Remark

(interacting field observables are formal deformation quantization)

The interacting field observables in def. are indeed formal power series in the formal parameter $\hbar$ (Planck's constant), as opposed to being more general Laurent series, hence they involve no negative powers of $\hbar$ (Dütsch-Fredenhagen 00, prop. 2 (ii), Hawkins-Rejzner 16, cor. 5.2). This is not immediate, since by def. the S-matrix that they are defined from does involve negative powers of $\hbar$ .

It follows in particular that the interacting field observables have a classical limit $\hbar \to 0$ , which is not the case for the S-matrix itself (due to it involving negative powers of $\hbar$ ). Indeed the interacting field observables constitute a formal deformation quantization of the covariant phase space of the interacting field theory (prop. below) and are thus the more fundamental concept.

As the name suggests, the S-matrices in def. serve to express scattering amplitudes (example below). But by remark the more fundamental concept is that of the interacting field observables. Their perspective reveals that consistent interpretation of scattering amplitudes requires the following condition on the relation between the vacuum state and the interaction term:

Definition

(vacuum stability)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let $g S_{int} \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g] ]\langle g \rangle$ be a local observable, regarded as an adiabatically switched interaction action functional.

We say that the given Hadamard vacuum state (prop. )

\langle - \rangle \;\colon\; PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar , g, j ] ] \longrightarrow \mathbb{C}[ [ \hbar, g, j ] ]

is stable with respect to the interaction $S_{int}$ , if for all elements of the Wick algebra

A \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g] ]

we have

\left\langle A \mathcal{S}(g S_{int}) \right\rangle \;=\; \left\langle \mathcal{S}(g S_{int}) \right\rangle \, \left\langle A \right\rangle \phantom{AA} \text{and} \phantom{AA} \left\langle \mathcal{S}(g S_{int})^{-1} A \right\rangle \;=\; \frac{1} { \left\langle \mathcal{S}(g S_{int}) \right\rangle } \left\langle A \right\rangle

Example

(time-ordered product of interacting field observables)

Consider two local observables

A_1, A_2 \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar , g] ]

with causally ordered spacetime support

supp(A_1) {\vee\!\!\!\!\wedge} supp(A_2)

Then causal additivity according to prop. implies that the Wick algebra-product of the corresponding interacting field observables $(A_1)_{int}, (A_2)_{int} \in PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]$ (def. ) is

\begin{aligned} (A_1)_{int} (A_2)_{int} & = \left( \frac{\partial}{\partial j} \mathcal{Z}(j A_1 ) \right)_{\vert j = 0} \left( \frac{\partial}{\partial j} \mathcal{Z}( j A_2 ) \right)_{\vert j = 0} \\ & = \frac{\partial^2}{\partial j_1 \partial j_2} \left( {\, \atop \,} \mathcal{Z}( j_1 A_1 ) \mathcal{Z}( j_2 A_2 ) {\, \atop \,} \right)_{ \left\vert { {j_1 = 0}, \atop {j_2 = 0} } \right. } \\ & = \frac{\partial^2}{\partial j_1 \partial j_2} \left( {\, \atop \,} \mathcal{Z}( j_1 A_1 + j_2 A_2 ) {\, \atop \,} \right)_{ \left\vert { {j_1 = 0}, \atop {j_2 = 0} } \right. } \end{aligned}

Here the last line makes sense if one extends the axioms on the S-matrix in prop. from formal power series in $\hbar, g, j$ to formal power series in $\hbar, g, j_1, j_2, \cdots$ (remark ). Hence in this generalization, the generating functions $\mathcal{Z}$ are not just generating functions for interacting field observables themselves, but in fact for time-ordered products of interacting field observables.

An important special case of time-ordered products of interacting field observables as in example is the following special case of scattering amplitudes, which is the example that gives the scattering matrix in def. its name:

Example

(scattering amplitudes as vacuum expectation values of interacting field observables)

Consider local observables

\array{ A_{in,1}, \cdots, A_{in , n_{in}}, \\ A_{out,1}, \cdots, A_{out, n_{out}} } \;\;\in\;\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g] ]

whose spacetime support satisfies the following causal ordering:

A_{out, i_{out} } {\gt\!\!\!\!\lt} A_{out, j_{out}} \phantom{AAA} A_{out, i_{out} } {\vee\!\!\!\wedge} S_{int} {\vee\!\!\!\wedge} A_{in, i_{in}} \phantom{AAA} A_{in, i_{in} } {\gt\!\!\!\!\lt} A_{in, j_{in}}

for all $1 \leq i_{out} \lt j_{out} \leq n_{out}$ and $1 \leq i_{in} \lt j_{in} \leq n_{in}$ .

Then the vacuum expectation value of the Wick algebra-product of the corresponding interacting field observables (def. ) is

\begin{aligned} & \left\langle {\, \atop \,} (A_{out, 1})_{int} \cdots (A_{out,n_{out}})_{int} \, (A_{in, 1})_{int} \cdots (A_{in,n_{in}})_{int} {\, \atop \,} \right\rangle \\ & = \left\langle {\, \atop \,} A_{out,1} \cdots A_{out,n_{out}} \right| \; \mathcal{S}(g S_{int}) \; \left| A_{in,1} \cdots A_{in, n_{in}} {\, \atop \,} \right\rangle \\ & \coloneqq \frac{1}{ \left\langle \mathcal{S}(g S_{int}) \right\rangle } \left\langle {\, \atop \,} A_{out,1} \cdots A_{out,n_{out}} \; \mathcal{S}(g S_{int}) \; A_{in,1} \cdots A_{in, n_{in}} {\, \atop \,} \right\rangle \,. \end{aligned}

These vacuum expectation values are interpreted, in the adiabatic limit where $g_{sw} \to 1$ , as scattering amplitudes (remark below).

Proof

For notational convenience, we spell out the argument for $n_{in} = 1 = n_{out}$ . The general case is directly analogous.

So assuming the causal order (def. )

supp(A_{out}) {\vee\!\!\!\wedge} supp(S_{int}) {\vee\!\!\!\wedge} supp(A_{in})

we compute with causal additivity via prop. as follows:

\begin{aligned} (A_{out})_{int} (A_{in})_{int} & = \frac{d^2 }{\partial j_{out} \partial j_{in}} \left( \mathcal{Z}( j_{out} A_{out} ) \mathcal{Z}( j_{in} A_{in} ) \right)_{\left\vert { { j_{out} = 0 } \atop { j_{in} = 0 } } \right.} \\ & = \frac{\partial^2 }{\partial j_{out} \partial j_{in}} \left( \mathcal{S}(g S_{int})^{-1} \underset{ = \mathcal{S}(j_{out} A_{out}) \mathcal{S}(g S_{int}) }{ \underbrace{ \mathcal{S}(g S_{int} + j_{out} A_{out}) } } \mathcal{S}(g S_{int})^{-1} \underset{ = \mathcal{S}(g S_{int}) \mathcal{S}(j_{in} A_{in}) }{ \underbrace{ \mathcal{S}(g S_{int} + j_{in}A_{in}) } } \right)_{\left\vert { { j_{out} = 0 } \atop { j_{in} = 0 } } \right.} \\ & = \frac{\partial^2 }{\partial j_{out} \partial j_{in}} \left( \mathcal{S}(g S_{int})^{-1} \mathcal{S}(j_{out} A_{out}) \underset{ = \mathcal{S}(g S_{int}) }{ \underbrace{ \mathcal{S}(g S_{int}) \mathcal{S}(g S_{int})^{-1} \mathcal{S}(g S_{int}) } } \mathcal{S}(j_{in} A_{in}) \right)_{\left\vert { { j_{out} = 0 } \atop { j_{in} = 0 } } \right.} \\ & = \mathcal{S}(g S_{int})^{-1} \, \left( {\, \atop \,} A_{out} \mathcal{S}(g S_{int}) A_{in} {\, \atop \,} \right) \,. \end{aligned}

With this the statement follows by the definition of vacuum stability (def. ).

Remark

(computing S-matrices via Feynman perturbation series)

For practical computation of vacuum expectation values of interacting field observables (example ) and hence in particular, via example , of scattering amplitudes, one needs some method for collecting all the contributions to the formal power series in increasing order in $\hbar$ and $g$ .

Such a method is provided by the Feynman perturbation series (example below) and the effective action (def. ), see example below.

$\,$

Conceptual remarks

The simple axioms for S-matrix schemes in causal perturbation theory (def. ) and hence for interacting field observables (def. ) have a wealth of implications and consequences. Before discussing these formally below, we here make a few informal remarks meant to put various relevant concepts into perspective:

Remark

(perturbative QFT and asymptotic expansion of probability amplitudes)

Given a perturbative S-matrix scheme (def. ), then by remark the expectation values of interacting field observables (def. ) are formal power series in the formal parameters $\hbar$ and $g$ (which are interpreted as Planck's constant, and as the coupling constant, respectively):

\left\langle A_{int} \right\rangle \;\in\; \mathbb{C}[ [\hbar, g] ] \,.

This means that there is no guarantee that these series converge for any positive value of $\hbar$ and/or $g$ . In terms of synthetic differential geometry this means that in perturbative QFT the deformation of the classical free field theory by quantum effects (measured by $\hbar$ ) and interactions (meaured by $g$ ) is so very tiny as to actually be infinitesimal: formal power series may be read as functions on the infinitesimal neighbourhood in a space of Lagrangian field theories at the point $\hbar = 0$ , $g = 0$ .

In fact, a simple argument (due to Dyson 52) suggests that in realistic field theories these series never converge for any positive value of $\hbar$ and/or $g$ . Namely convergence for $g$ would imply a positive radius of convergence around $g = 0$ , which would imply convergence also for $-g$ and even for imaginary values of $g$ , which would however correspond to unstable interactions for which no converging field theory is to be expected. (See Helling, p. 4 for the example of phi^4 theory.)

In physical practice one tries to interpret these non-converging formal power series as asymptotic expansions of actual but hypothetical functions in $\hbar, g$ , which reflect the actual but hypothetical non-perturbative quantum field theory that one imagines is being approximated by perturbative QFT methods. An asymptotic expansion of a function is a power series which may not converge, but which has for every $n \in \mathbb{N}$ an estimate for how far the sum of the first $n$ terms in the series may differ from the function being approximated.

For examples such as quantum electrodynamics and quantum chromodynamics, as in the standard model of particle physics, the truncation of these formal power series scattering amplitudes to the first handful of loop orders in $\hbar$ happens to agree with experiment (such as at the LHC collider) to high precision (for QED) or at least decent precision (for QCD), at least away from infrared phenomena (see remark ).

In summary this says that perturbative QFT is an extremely coarse and restrictive approximation to what should be genuine non-perturbative quantum field theory, while at the same time it happens to match certain experimental observations to remarkable degree, albeit only if some ad-hoc truncation of the resulting power series is considered.

This is strong motivation for going beyond perturbative QFT to understand and construct genuine non-perturbative quantum field theory. Unfortunately, this is a wide-open problem, away from toy examples. Not a single interacting field theory in spacetime dimension $\geq 4$ has been non-perturbatively quantized. Already a single aspect of the non-perturbative quantization of Yang-Mills theory (as in QCD) has famously been advertized as one of the Millennium Problems of our age; and speculation about non-perturbative quantum gravity is the subject of much activity.

Now, as the name indicates, the axioms of causal perturbation theory (def. ) do not address non-perturbative aspects of non-perturbative field theory; the convergence or non-convergence of the formal power series that are axiomatized by Bogoliubov's formula (def. ) is not addressed by the theory. The point of the axioms of causal perturbation theory is to give rigorous mathematical meaning to everything else in perturbative QFT.

Remark

(Dyson series and Schrödinger equation in interaction picture)

The axiom “causal additivity” (230) on an S-matrix scheme (def. ) implies immediately this seemingly weaker condition (which turns out to be equivalent, this is prop. below):

(causal factorization)

For all local observables $O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [\hbar, h, j] ]\langle g , j\rangle$ we have

$\left( {\, \atop \,} supp(O_1) {\vee\!\!\!\wedge} supp(O_2) {\, \atop \,} \right) \;\; \Rightarrow \;\; \left( {\, \atop \,} \mathcal{S}( O_1 + O_2 ) = \mathcal{S}( O_1 ) \, \mathcal{S}( O_2 ) {\, \atop \,} \right)$

(This is the special case of “causal additivity” for $O_0 = 0$ , using that by the axiom “perturbation” (229) we have $\mathcal{S}(0) = 1$ .)

If we now think of $O_1 = g S_{1}$ and $O_2 = g S_2$ themselves as adiabatically switched interaction action functionals, then this becomes

\left( {\, \atop \,} supp(S_1) {\vee\!\!\!\wedge} supp(S_2) {\, \atop \,} \right) \;\; \Rightarrow \;\; \left( {\, \atop \,} \mathcal{S}( g S_1 + g S_2 ) = \mathcal{S}( g S_1) \, \mathcal{S}( g sS_2) {\, \atop \,} \right)

This exhibits the S-matrix-scheme as a “causally ordered exponential” or “Dyson series” of the interaction, hence as a refinement to relativistic field theory of what in quantum mechanics is the “integral version of the Schrödinger equation in the interaction picture” (see this equation at S-matrix; see also Scharf 95, second half of 0.3).

The relevance of manifest causal additivity of the S-matrix, over just causal factorization (even though both conditions happen to be equivalent, see prop. below), is that it directly implies that the induced interacting field algebra of observables (def. ) forms a causally local net (prop. below).

Remark

(path integral-intuition)

In informal discussion of perturbative QFT going back to informal ideas of Schwinger-Tomonaga-Feynman-Dyson, the perturbative S-matrix is thought of in terms of a would-be path integral, symbolically written

\mathcal{S}\left( g S_{int} + j A \right) \;\overset{\text{not really!}}{=}\; \!\!\!\!\! \underset{\Phi \in \Gamma_\Sigma(E_{\text{BV-BRST}})_{asm}}{\int} \!\!\!\!\!\! \exp\left( \tfrac{1}{i \hbar} \int_\Sigma \left( g L_{int}(\Phi) + j A(\Phi) \right) \right) \, \exp\left( \tfrac{1}{i \hbar}\int_\Sigma L_{free}(\Phi) \right) D[\Phi] \,.

Here the would-be integration is thought to be over the space of field histories $\Gamma_\Sigma(E_{\text{BV-BRST}})_{asm}$ (the space of sections of the given field bundle, remark ) for field histories which satisfy given asymptotic conditions at $x^0 \to \pm \infty$ ; and as these boundary conditions vary the above is regarded as a would-be integral kernel that defines the required operator in the Wick algebra (e.g. Weinberg 95, around (9.3.10) and (9.4.1)). This is related to the intuitive picture of the Feynman perturbation series (example below) expressing a sum over all possible interactions of virtual particles (remark ).

Beyond toy examples, it is not known how to define the would-be measure $D[\Phi]$ and it is not known how to make sense of this expression as an actual integral.

The analogous path-integral intuition for Bogoliubov's formula for interacting field observables (def. ) symbolically reads

\begin{aligned} A_{int} & \overset{\text{not really!}}{=} \frac{d}{d j} \ln \left( \underset{\Phi \in \Gamma_\Sigma(E)_{asm}}{\int} \!\!\!\! \exp\left( \underset{\Sigma}{\int} g L_{int}(\Phi) + j A(\Phi) \right) \, \exp\left( \underset{\Sigma}{\int} L_{free}(\Phi) \right) D[\Phi] \right) \vert_{j = 0} \end{aligned}

If here we were to regard the expression

\mu(\Phi) \;\overset{\text{not really!}}{\coloneqq}\; \frac{ \exp\left( \underset{\Sigma}{\int} L_{free}(\Phi) \right)\, D[\Phi] } { \underset{\Phi \in \Gamma_\Sigma(E_{\text{BV-BRST}})_{asm}}{\int} \!\!\!\! \exp\left( \underset{\Sigma}{\int} L_{free}(\Phi) \right)\, D[\Phi] }

as a would-be Gaussian measure on the space of field histories, normalized to a would-be probability measure, then this formula would express interacting field observables as ordinary expectation values

A_{int} \overset{\text{not really!}}{=} \!\!\! \underset{\Phi \in \Gamma_\Sigma(E_{\text{BV-BRST}})_{asm}}{\int} \!\!\!\!\!\! A(\Phi) \,\mu(\Phi) \,.

As before, beyond toy examples it is not known how to make sense of this as an actual integration.

But we may think of the axioms for the S-matrix in causal perturbation theory (def. ) as rigorously defining the path integral, not analytically as an actual integration, but synthetically by axiomatizing the properties of the desired outcome of the would-be integration:

The analogy with a well-defined integral and the usual properties of an exponential vividly suggest that the would-be path integral should obey causal factorization. Instead of trying to make sense of path integration so that this factorization property could then be appealed to as a consequence of general properties of integration and exponentials, the axioms of causal perturbation theory directly prescribe the desired factorization property, without insisting that it derives from an actual integration.

The great success of path integral-intuition in the development of quantum field theory, despite the dearth of actual constructions, indicates that it is not the would-be integration process as such that actually matters in field theory, but only the resulting properties that this suggests the S-matrix should have; which is what causal perturbation theory axiomatizes. Indeed, the simple axioms of causal perturbation theory rigorously imply finite (i.e. ("re"-)normalized) perturbative quantum field theory (see remark ).

\array{ \array{ \text{would-be} \\ \text{path integral} \\ \text{intuition} } & \overset{ \array{ \text{informally} \\ \text{suggests} } }{\longrightarrow} & \array{ \text{causally additive} \\ \text{scattering matrix} } & \overset{ \array{ \text{rigorously} \\ \text{implies} } }{\longrightarrow} & \array{ \text{UV-finite} \\ \text{(i.e. (re-)normalized)} \\ \text{perturbative QFT} } }

Remark

(scattering amplitudes)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

S_{int} \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]

be a local observable, regarded as an adiabatically switched interaction action functional.

Then for

A_{in}, A_{out} \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [\hbar] ]

two microcausal polynomial observables, with causal ordering

supp(A_{out}) {\vee\!\!\!\wedge} supp(A_{int})

the corresponding scattering amplitude (as in example ) is the value (called “expectation value” when referring to $A^\ast_{out} \, \mathcal{S}(S_{int}) \, A_{in}$ , or “matrix element” when referring to $\mathcal{S}(S_{int})$ , or “transition amplitude” when referring to $\left\langle A_{out} \right\vert$ and $\left\vert A_{in} \right\rangle$ )

\left\langle A_{out} \,\vert\, \mathcal{S}(S_{int}) \,\vert\, A_{in} \right\rangle \;\coloneqq\; \left\langle A^\ast_{out} \, \mathcal{S}(S_{int}) \, A_{in} \right\rangle \;\in\; \mathbb{C}[ [ \hbar, g ] ] \,.

for the Wick algebra-product $A^\ast_{out} \, \mathcal{S}(S_{int})\, A_{in} \in PolyObs(E_{\text{BV-BRST}})[ [\hbar, g ] ]$ in the given Hadamard vacuum state $\langle -\rangle \colon PolyObs(E_{\text{BV-BRST}})[ [\hbar, g] ] \to \mathbb{C}[ [\hbar,g] ]$ .

If here $A_{in}$ and $A_{out}$ are monomials in Wick algebra-products of the field observables $\mathbf{\Phi}^a(x) \in Obs(E_{\text{BV-BRST}})[ [\hbar] ]$ , then this scattering amplitude comes from the integral kernel

\begin{aligned} & \left\langle \mathbf{\Phi}^{a_{out,1}}(x_{out,1}) \cdots \mathbf{\Phi}^{a_{out,s}}(x_{out,s}) \vert \, \mathcal{S}(S_{int}) \, \vert \mathbf{\Phi}^{a_{in,1}}(x_{in,1}) \cdots \mathbf{\Phi}^{a_{in,r}}(x_{in,r}) \right\rangle \\ & \coloneqq \left\langle \left(\mathbf{\Phi}^{a_{out,1}}(x_{out,1})\right)^\ast \cdots \left(\mathbf{\Phi}^{a_{out,s}}(x_{out,s})\right)^\ast \;\mathcal{S}(S_{int})\; \mathbf{\Phi}^{a_{in,1}}(x_{in,1}) \cdots \mathbf{\Phi}^{a_{in,r}}(x_{in,r}) \right\rangle \end{aligned}

or similarly, under Fourier transform of distributions,

(235)

\begin{aligned} & \left\langle \widehat{\mathbf{\Phi}}^{a_{out,1}}(k_{out,1}) \cdots \widehat{\mathbf{\Phi}}^{a_{out,s}}(k_{out,s}) \vert \, \mathcal{S}(S_{int}) \, \vert \widehat{\mathbf{\Phi}}^{a_{in,1}}(k_{in,1}) \cdots \widehat{\mathbf{\Phi}}^{a_{in,r}}(k_{in,r}) \right\rangle \\ & \coloneqq \left\langle \left(\widehat{\mathbf{\Phi}}^{a_{out,1}}(k_{out,1})\right)^\ast \cdots \left(\widehat{\mathbf{\Phi}}^{a_{out,s}}(k_{out,s})\right)^\ast \;\mathcal{S}(S_{int})\; \widehat{\mathbf{\Phi}}^{a_{in,1}}(k_{in,1}) \cdots \widehat{\mathbf{\Phi}}^{a_{in,r}}(k_{in,r}) \right\rangle \end{aligned} \,.

These are interpreted as the (distributional) probability amplitudes for plane waves of field species $a_{in,\cdot}$ with wave vector $k_{in,\cdot}$ to come in from the far past, ineract with each other via $S_{int}$ , and emerge in the far future as plane waves of field species $a_{out,\cdot}$ with wave vectors $k_{out,\cdot}$ .

Or rather:

Remark

(adiabatic limit, infrared divergences and interacting vacuum)

Since a local observable $S_{int} \in LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ]$ by definition has compact spacetime support, the scattering amplitudes in remark describe scattering processes for interactions that vanish (are “adiabatically switched off”) outside a compact subset of spacetime. This constraint is crucial for causal perturbation theory to work.

There are several aspects to this:

(adiabatic limit) On the one hand, real physical interactions $\mathbf{L}_{int}$ (say the electron-photon interaction) are not really supposed to vanish outside a compact region of spacetime. In order to reflect this mathematically, one may consider a sequence of adiabatic switchings $g_{sw} \in C^\infty_{cp}(\Sigma)\langle g \rangle$ (each of compact support) whose limit is the constant function $g \in C^\infty(\Sigma)\langle g\rangle$ (the actual coupling constant), then consider the corresponding sequence of interaction action functionals $S_{int,sw} \coloneqq \tau_\Sigma( g_{sw} \mathbf{L}_{int} )$ and finally consider:
1. as the true scattering amplitude the corresponding limit
  
  $\left\langle A_{out} \vert \mathcal{S}(S_{int}) \vert A_{int} \right\rangle \;\coloneqq\; \underset{g_{sw} \to 1}{\lim} \left\langle A_{out} \vert \mathcal{S}(S_{int,sw}) \vert A_{int} \right\rangle$
  
  of adiabatically switched scattering amplitudes (remark ) – if it exists. This is called the strong adiabatic limit.
2. as the true n-point functions the corresponding limit
  
  $\begin{aligned} & \left\langle \mathbf{\Phi}^{a_1}_{int}(x_1) \mathbf{\Phi}^{a_2}_{int}(x_2) \cdots \mathbf{\Phi}^{a_{n-1}}_{int}(x_{n-1}) \mathbf{\Phi}^{a_n}_{int,sw}(x_n) \right\rangle \\ & = \underset{\underset{g_{sw} \to 1}{\longrightarrow}}{\lim} \left\langle \mathbf{\Phi}^{a_1}_{int,sw}(x_1) \mathbf{\Phi}^{a_2}_{int,sw}(x_2) \cdots \mathbf{\Phi}^{a_{n-1}}_{int,sw}(x_{n-1}) \mathbf{\Phi}^{a_n}_{int,sw}(x_n) \right\rangle \end{aligned}$
  
  of tempered distributional expectation values of products of interacting field observables (def. ) – if it exists. (Similarly for time-ordered products.) This is called the weak adiabatic limit.
Beware that the left hand sides here are symbolic: Even if the limit exists in expectation values, in general there is no actual observable whose expectation value is that limit.

The strong and weak adiabatic limits have been shown to exist if all fields are massive (Epstein-Glaser 73). The weak adiabatic limit has been shown to exists for quantum electrodynamics and for mass-less phi^4 theory (Blanchard-Seneor 75) and for larger classes of field theories in (Duch 17, p. 113, 114).

If these limits do not exist, one says that the perturbative QFT has an infrared divergence.
(algebraic adiabatic limit) On the other hand, it is equally unrealistic that an actual experiment detects phenomena outside a given compact subset of spacetime. Realistic scattering experiments (such as the LHC) do not really prepare or measure plane waves filling all of spacetime as described by the scattering amplitudes (235). Any observable that is realistically measurable must have compact spacetime support. We see below in prop. that such interacting field observables with compact spacetime support may be computed without taking the adiabatic limit: It is sufficient to use any adiabatic switching which is constant on the support of the observable.

This way one obtains for each causally closed subset $\mathcal{O}$ of spacetime an algebra of observables $\mathcal{A}_{int}(\mathcal{O})$ whose support is in $\mathcal{O}$ , and for each inclusion of subsets a corresponding inclusion of algebras of observables (prop. below). Of this system of observables one may form the category-theoretic inductive limit to obtain a single global algebra of observables.

$\mathcal{A}_{int} \;\coloneqq\; \underset{\underset{\mathcal{O}}{\longrightarrow}}{\lim} \mathcal{A}_{int}(\mathcal{O})$

This always exists. It is called the algebraic adiabatic limit (going back to Brunetti-Fredenhagen 00, section 8).

For quantum electrodynamics the algebraic adiabatic limit was worked out in (Dütsch-Fredenhagen 98, reviewed in Dütsch 18, 5,3).
(interacting vacuum) While, via the above algebraic adiabatic limit, causal perturbation theory yields the correct interacting field algebra of quantum observables independent of choices of adiabatic switching, a theory of quantum probability requires, on top of the algebra of observables, also a state

$\langle - \rangle_{int} \;\colon\; \mathcal{A}_{int} \longrightarrow \mathbb{C}[ [\hbar] ]$

Just as the interacting field algebra of observables $\mathcal{A}_{int}$ is a deformation of the free field algebra of observables (Wick algebra), there ought to be a corresponding deformation of the free Hadamard vacuum state $\langle- \rangle$ into an “interacting vacuum state” $\langle - \rangle_{int}$ .

Sometimes the weak adiabatic limit serves to define the interacting vacuum (see Duch 17, p. 113-114).

A stark example of these infrared issues is the phenomenon of confinement of quarks to hadron bound states (notably to protons and neutrons) at large wavelengths. This is paramount in observation and reproduced in numerical lattice gauge theory simulation, but is invisible to perturbative quantum chromodynamics in its free field vacuum state, due to infrared divergences. It is expected that this should be rectified by the proper interacting vacuum of QCD (Rafelski 90, pages 12-16), which is possibly a “theta-vacuum” exhibiting superposition of QCD instantons (Schäfer-Shuryak 98, section III.D). This remains open, closely related to the Millennium Problem of quantization of Yang-Mills theory.

In contrast to the above subtleties about the infrared divergences, any would-be UV-divergences in perturbative QFT are dealt with by causal perturbation theory:

Remark

(the traditional error leading to UV-divergences)

Naively it might seem that (say over Minkowski spacetime, for simplicity) examples of time-ordered products according to def. might simply be obtained by multiplying Wick algebra-products with step functions $\Theta$ of the time coordinates, hence to write, in the notation as generalized functions (remark ):

T(x_1, x_2) \overset{\text{no!}}{=} \Theta(x_1^0 - x_2^0) \, T(x_1) \, T(x_2) + \Theta(x_2^0 - x_1^0) \, T(x_2) \, T(x_1)

and analogously for time-ordered products of more arguments (for instance Weinberg 95, p. 143, between (3.5.9) and (3.5.10)).

This however is simply a mathematical error (as amplified in Scharf 95, below (3.2.4), below (3.2.44) and in fig. 3):

Both $T$ as well as $\Theta$ are distributions and their product of distributions is in general not defined, as Hörmander's criterion (prop. ), which is exactly what guarantees absence of UV-divergences (remark ), may be violated. The notorious ultraviolet divergences which plagued (Feynman 85) the original conception of perturbative QFT due to Schwinger-Tomonaga-Feynman-Dyson are the signature of this ill-defined product (see remark ).

On the other hand, when both distributions are restricted to the complement of the diagonal (i.e. restricted away from coinciding points $x_1 = x_2$ ), then the step function becomes a non-singular distribution so that the above expression happens to be well defined and does solve the axioms for time-ordered products.

Hence what needs to be done to properly define the time-ordered product is to choose an extension of distributions of the above product expression back from the complement of the diagonal to the whole space of tuples of points. Any such extension will produce time-ordered products.

There are in general several different such extensions. This freedom of choice is the freedom of "re-"normalization; or equivalently, by the main theorem of perturbative renormalization theory (theorem below), this is the freedom of choosing “counterterms” (remark below) for the local interactions. This we discuss below and in more detail in the next chapter.

Remark

(absence of ultraviolet divergences and re-normalization)

The simple axioms of causal perturbation theory (def. ) do fully capture perturbative quantum field theory “in the ultraviolet”: A solution to these axioms induces, by definition, well-defined perturbative scattering amplitudes (remark ) and well-defined perturbative probability amplitudes of interacting field observables (def. ) induced by local action functionals (describing point-interactions such as the electron-photon interaction). By the main theorem of perturbative renormalization (theorem ) such solutions exist. This means that, while these are necessarily formal power series in $\hbar$ and $g$ (remark ), all the coefficients of these formal power series (“loop order contributions”) are well defined.

This is in contrast to the original informal conception of perturbative QFT due to Schwinger-Tomonaga-Feynman-Dyson, which in a first stage produced ill-defined diverging expressions for the coefficients (due to the mathematical error discussed in remark below), which were then “re-normalized” to finite values, by further informal arguments.

Here in causal perturbation theory no divergences in the coefficients of the formal power series are considered in the first place, all coefficients are well-defined, hence “finite”. In this sense causal perturbation theory is about “finite” perturbative QFT, where instead of “re-normalization” of ill-defined expressions one just encounters “normalization” (prominently highlighted in Scharf 95, see title, introduction, and section 4.3), namely compatible choices of these finite values. The actual “re-normalization” in the sense of “change of normalization” is expressed by the Stückelberg-Petermann renormalization group.

This refers to those divergences that are known as UV-divergences, namely short-distance effects, which are mathematically reflected in the fact that the perturbative S-matrix scheme (def. ) is defined on local observables, which, by their very locality, encode point-interactions. See also remark on infrared divergences.

Remark

(virtual particles, worldline formalism and perturbative string theory)

It is suggestive to think of the edges in the Feynman diagrams (def. ) as worldlines of “virtual particles” and of the vertices as the points where they collide and transmute. (Care must be exercised not to confuse this with concepts of real particles.) With this interpretation prop. may be read as saying that the scattering amplitude for given external source fields (remark ) is the superposition of the Feynman amplitudes of all possible ways that these may interact; which is closely related to the intuition for the path integral (remark ).

This intuition is made precise by the worldline formalism of perturbative quantum field theory (Strassler 92). This is the perspective on perturbative QFT which directly relates perturbative QFT to perturbative string theory (Schmidt-Schubert 94). In fact the worldline formalism for perturbative QFT was originally found by taking thre point-particle limit of string scattering amplitudes (Bern-Kosower 91, Bern-Kosower 92).

Remark

(renormalization scheme)

Beware the terminology in def. : A single S-matrix is one single observable

\mathcal{S}(S_{int}) \;\in\; PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [g,j] ]

for a fixed (adiabatically switched local) interaction $S_{int}$ , reflecting the scattering amplitudes (remark ) with respect to that particular interaction. Hence the function

\mathcal{S} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [\hbar, g,j] ]\langle g, j \rangle \longrightarrow PolyObs(E_{\text{BV-BRST}})((\hbar))[ [g,j] ]

axiomatized in def. is really a whole scheme for constructing compatible S-matrices for all possible (adiabatically switched, local) interactions at once.

Since the usual proof of the construction of such schemes of S-matrices involves ("re"-)normalization, the function $\mathcal{S}$ axiomatized by def. may also be referred to as a ("re"-)normalization scheme.

This perspective on $\mathcal{S}$ as a renormalization scheme is amplified by the main theorem of perturbative renormalization (theorem ) wich states that the space of choices for $\mathcal{S}$ is a torsor over the Stückelberg-Petermann renormalization group.

Remark

(quantum anomalies)

The axioms for the S-matrix in def. (and similarly that for the time-ordered products below in def. ) are sufficient to imply a causally local net of perturbative interacting field algebras of quantum observables (prop. below), and thus its algebraic adiabatic limit (remark ).

It does not guarantee, however, that the BV-BRST differential passes to those algebras of quantum observables, hence it does not guarantee that the infinitesimal symmetries of the Lagrangian are respected by the quantization process (there may be “quantum anomalies”). The extra condition that does ensure this is the quantum master Ward identity or quantum master equation. This we discuss elsewhere.

Apart from gauge symmetries one also wants to require that rigid symmetries are preserved by the S-matrix, notably Poincare group-symmetry for scattering on Minkowski spacetime.

$\,$

Interacting field observables

We now discuss how the perturbative interacting field observables which are induced from an S-matrix enjoy good properties expected of any abstractly defined perturbative algebraic quantum field theory.

$\,$

Definition

(interacting field algebra of observables – quantum Møller operator)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let $g S_{int} \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]$ be a local observable regarded as an adiabatically switched interaction-functional.

We write

LocIntObs_{\mathcal{S}}(E_{\text{BV-BRST}}, g S_{int}) \;\coloneqq\; \left\{ {\, \atop \,} A_{int} \;\vert\; A \in LocObs(E_{BV-BRST})[ [ \hbar, g ] ] {\, \atop \,} \right\} \hookrightarrow PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g] ]

for the subspace of interacting field observables $A_{int}$ (def. ) corresponding to local observables $A$ , the local interacting field observables.

Furthermore we write

\array{ LocObs(E_{\text{BV-BRST}})[ [ \hbar , g] ] & \underoverset{\simeq}{\phantom{A}\mathcal{R}^{-1}\phantom{A}}{\longrightarrow} & IntLocObs(E_{\text{BV-BRST}}, g S_{int})[ [ \hbar , g ] ] \\ A &\mapsto& A_{int} \coloneqq \mathcal{S}(g S_{int})^{-1} T( \mathcal{S}(g S_{int}), A ) }

for the factorization of the function $A \mapsto A_{int}$ through its image, which, by remark , is a linear isomorphism with inverse

\array{ IntLocObs(E_{\text{BV-BRST}}, g S_{int})[ [ \hbar , g ] ] & \underoverset{\simeq}{\phantom{A}\mathcal{R}\phantom{A}}{\longrightarrow} & LocObs(E_{\text{BV-BRST}})[ [ \hbar , g] ] \\ A_{int} &\mapsto& A \coloneqq T\left( \mathcal{S}(-g S_{int}) , \left( \mathcal{S}(g S_{int}) A_{int} \right) \right) }

This may be called the quantum Møller operator (Hawkins-Rejzner 16, (33)).

Finally we write

\begin{aligned} IntObs(E_{\text{BV-BRST}}, S_{int}) & \coloneqq \left\langle {\, \atop \,} IntLocObs(E_{\text{BV-BRST}})[ [ \hbar, g] ] {\, \atop \,} \right\rangle \\ & \phantom{\coloneqq} \hookrightarrow PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ] \end{aligned}

for the smallest subalgebra of the Wick algebra containing the interacting local observables. This is the perturbative interacting field algebra of observables.

The definition of the interacting field algebra of observables from the data of a scattering matrix (def. ) via Bogoliubov's formula (def. ) is physically well-motivated, but is not immediately recognizable as the result of applying a systematic concept of quantization (such as formal deformation quantization) to the given Lagrangian field theory. The following proposition says that this is nevertheless the case. (The special case of this statement for free field theory is discussed at Wick algebra, see remark ).

Proposition

(interacting field algebra of observables is formal deformation quantization of interacting Lagrangian field theory)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , and let $g_{sw} \mathbf{L}_{int} \in \Omega^{p+1,0}_{\Sigma,cp}(E_{\text{BV-BRST}})[ [\hbar, g ] ]\langle g\rangle$ be an adiabatically switched interaction Lagrangian density with corresponding action functional $g S_{int} \coloneqq \tau_\Sigma( g_{sw} \mathbf{L}_{int} )$ .

Then, at least on regular polynomial observables, the construction of perturbative interacting field algebras of observables in def. is a formal deformation quantization of the interacting Lagrangian field theory $(E_{\text{BV-BRST}}, \mathbf{L}' + g_{sw} \mathbf{L}_{int})$ .

(Hawkins-Rejzner 16, prop. 5.4, Collini 16)

The following definition collects the system (a co-presheaf) of generating functions for interacting field observables which are localized in spacetime as the spacetime localization region varies:

Definition

(system of spacetime-localized generating functions for interacting field observables)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g ] ]

be a Lagrangian density, to be thought of as an interaction, so that for $g_{sw} \in C^\infty_{sp}(\Sigma)\langle g \rangle$ an adiabatic switching the transgression

S_{int,sw} \;\coloneqq\; \tau_\Sigma(g_{sw} \mathbf{L}_{int}) \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]

is a local observable, to be thought of as an adiabatically switched interaction action functional.

For $\mathcal{O} \subset \Sigma$ a causally closed subset of spacetime (def. ) and for $g_{sw} \in Cutoffs(\mathcal{O})$ an adiabatic switching function (def. ) which is constant on a neighbourhood of $\mathcal{O}$ , write

Gen(E_{\text{BV-BRST}}, S_{int,sw} )(\mathcal{O}) \;\coloneqq\; \left\langle \mathcal{Z}_{S_{int,sw}}(j A) \;\vert\; A \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g] ] \,\text{with}\, supp(A) \subset \mathcal{O} \right\rangle \;\subset\; PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]

for the smallest subalgebra of the Wick algebra which contains the generating functions (def. ) with respect to $S_{int,sw}$ for all those local observables $A$ whose spacetime support is in $\mathcal{O}$ .

Moreover, write

Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}) \;\subset\; \underset{g_{sw} \in Cutoffs(\mathcal{O})}{\prod} Gen(E_{\text{BV-BRST}}, S_{int,sw})(\mathcal{O})

be the subalgebra of the Cartesian product of all these algebras as $g_{sw}$ ranges over cutoffs, which is generated by the tuples

\mathcal{Z}_{\mathbf{L}_{int}}(A) \;\coloneqq\; \left( \mathcal{Z}_{S_{int,sw}}(j A) \right)_{g_{sw} \in Cutoffs(\mathcal{O})}

for $A$ with $supp(A) \subset \mathcal{O}$ .

We call $Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int} )(\mathcal{O})$ the algebra of generating functions for interacting field observables localized in $\mathcal{O}$ .

Finally, for $\mathcal{O}_1 \subset \mathcal{O}_2$ an inclusion of two causally closed subsets, let

i_{\mathcal{O}_1, \mathcal{O}_2} \;\colon\; Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_1) \longrightarrow Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_2)

be the algebra homomorphism which is given simply by restricting the index set of tuples.

This construction defines a functor

Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int}) \;\colon\; CausClsdSubsets(\Sigma) \longrightarrow Algebras

from the poset of causally closed subsets of spacetime to the category of algebras.

(extends to star algebras if scattering matrices are chosen unitary…)

(Brunetti-Fredenhagen 99, (65)-(67))

The key technical fact is the following:

Proposition

(localized interacting field observables independent of adiabatic switching)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g ] ]

be a Lagrangian density, to be thought of as an interaction, so that for $g_{sw} \in C^\infty_{sp}(\Sigma)\langle g \rangle$ an adiabatic switching the transgression

g S_{int,sw} \;\coloneqq\; \tau_\Sigma(g_{sw} \mathbf{L}_{int}) \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]

is a local observable, to be thought of as an adiabatically switched interaction action functional.

If two such adiabatic switchings $g_{sw,1}, g_{sw,2} \in C^\infty_{cp}(\Sigma)$ agree on a causally closed subset

\mathcal{O} \;\subset\; \Sigma

in that

g_{sw,1}\vert_{\mathcal{O}} = g_{sw,2}\vert_{\mathcal{O}}

then there exists a microcausal polynomial observable

K \;\in\; PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j ] ]

such that for every local observable

A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]

with spacetime support in $\mathcal{O}$

supp(A) \;\subset\; \mathcal{O}

the corresponding two generating functions (232) are related via conjugation by $K$ :

(236)

\mathcal{Z}_{S_{int,sw_2}} \left( j A \right) \;=\; K^{-1} \, \left( \mathcal{Z}_{S_{int,sw_1}} \left( j A \right) \right) \, K \,.

In particular this means that for every choice of adiabatic switching $g_{sw} \in Cutoffs(\mathcal{O})$ the algebra $Gen_{S_{int,sw}}(\mathcal{O})$ of generating functions for interacting field observables computed with $g_{sw}$ is canonically isomorphic to the abstract algebra $Gen_{\mathbf{L}_{int}}(\mathcal{O})$ (def. ), by the evident map on generators:

(237)

\array{ Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{o}) &\overset{\simeq}{\longrightarrow}& Gen(E_{\text{BV-BRST}}, S_{int,sw})(\mathcal{O}) \\ \left( \mathcal{Z}_{S_{int,sw'}} \right)_{g_{sw'} \in Cutoffs(\mathcal{O})} &\mapsto& \mathcal{Z}_{S_{int,sw}} } \,.

(Brunetti-Fredenhagen 99, prop. 8.1)

Proof

By causal closure of $\mathcal{O}$ , lemma says that there are bump functions

a, r \in C^\infty_{cp}(\Sigma)\langle g \rangle

which decompose the difference of adiabatic switchings

g_{sw,2} - g_{sw,1} = a + r

subject to the causal ordering

supp(a) \,{\vee\!\!\!\wedge}\, \mathcal{O} \,{\vee\!\!\!\wedge}\, supp(r) \,.

With this the result follows from repeated use of causal additivity in its various equivalent incarnations from prop. :

\begin{aligned} & \mathcal{Z}_{g S_{int,sw_2}}(j A) \\ & = \mathcal{Z}_{ \left( \tau_\Sigma \left( g_{sw,2} \mathbf{L}_{int} \right) \right) } \left( j A \right) \\ & = \mathcal{Z}_{ \left( \tau_\Sigma \left( (g_{sw,1} + a + r)\mathbf{L}_{int} \right) \right) } \left( j A \right) \\ & = \mathcal{Z}_{ \left( g S_{int,sw_1} + \tau_\Sigma \left( r \mathbf{L}_{int} \right) + \tau_\Sigma \left( a \mathbf{L}_{int} \right) \right) } \left( j A \right) \\ & = \mathcal{Z}_{ \left( g S_{int,sw_1} + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) } \left( j A \right) \\ & = \mathcal{S} \left( g S_{int,sw_1} + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right)^{-1} \, \mathcal{S} \left( g S_{int,sw_1} + j A + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) \\ & = \mathcal{S} \left( g S_{int,sw_1} + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right)^{-1} \, \mathcal{S} \left( g S_{int,sw_1} + j A \right) \, \mathcal{S} \left( g S_{int,sw_1} \right)^{-1} \, \mathcal{S} \left( j A + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) \\ & = \mathcal{S} \left( g S_{int,sw_1} + \tau_\Sigma \left( r\mathbf{L}_{int} \right) \right)^{-1} \, \underset{ = id }{ \underbrace{ \mathcal{S} \left( g S_{int,sw_1} \right) \, \mathcal{S} \left( g S_{int,sw_1} \right)^{-1} } } \, \mathcal{S} \left( g S_{int,sw_1} + j A \right) \, \mathcal{S} \left( g S_{int , sw_1} \right)^{-1} \, \mathcal{S} \left( j A + \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) \\ & = \underset{ K^{-1} }{ \underbrace{ \left( \mathcal{Z}_{ g S_{int,sw_1} } \left( \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) \right)^{-1} } } \, \mathcal{Z}_{ g S_{int,sw_1} } \left( j A \right) \,\, \underset{ K }{ \underbrace{ \mathcal{Z}_{ g S_{int,sw_1} } \left( \tau_\Sigma \left( r \mathbf{L}_{int} \right) \right) }} \end{aligned}

This proves the existence of elements $K$ as claimed.

It is clear that conjugation induces an algebra homomorphism, and since the map is a linear isomorphism on the space of generators, it is an algebra isomorphism on the algebras being generated (237).

(While the elements $K$ in (236) are far from being unique themselves, equation (236) says that the map on generators induced by conjugation with $K$ is independent of this choice.)

Proposition

(system of generating algebras is causally local net)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g ] ]

be a Lagrangian density, to be thought of as an interaction.

Then the system

Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int}) \;\colon\; CausCldSubsets(\Sigma) \longrightarrow Algebra

of localized generating functions for interacting field observables (def. ) is a causally local net in that it satisfies the following conditions:

(isotony) For every inclusion $\mathcal{O}_1 \subset \mathcal{O}_2$ of causally closed subsets of spacetime the corresponding algebra homomorphism is a monomorphism

$i_{\mathcal{O}_1, \mathcal{O}_2} \;\colon\; Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_1) \hookrightarrow Gen(E_{\text{BV-BRST}},\mathbf{L}_{int})(\mathcal{O}_2)$
(causal locality) For $\mathcal{O}_1, \mathcal{O}_2 \subset X$ two causally closed subsets which are spacelike separated, in that their causal ordering (def. ) satisfies

$\mathcal{O}_1 {\vee\!\!\!\wedge} \mathcal{O}_2 \;\text{and}\; \mathcal{O}_2 {\vee\!\!\!\wedge} \mathcal{O}_1$

and for $\mathcal{O} \subset \Sigma$ any further causally closed subset which contains both

$\mathcal{O}_1 , \mathcal{O}_2 \subset \mathcal{O}$

then the corresponding images of the generating function algebras of interacting field observables localized in $\mathcal{O}_1$ and in $\mathcal{O}_2$ , respectively, commute with each other as subalgebras of the generating function algebras of interacting field observables localized in $\mathcal{O}$ :

$\left[ i_{\mathcal{O}_1,\mathcal{O}}(Gen_{L_{int}}(\mathcal{O}_1)) \;,\; i_{\mathcal{O}_2,\mathcal{O}}(Gen_{L_{int}}(\mathcal{O}_2)) \right] \;=\; 0 \;\;\; \in Gen(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}) \,.$

(Dütsch-Fredenhagen 00, section 3, following Brunetti-Fredenhagen 99, section 8, Il’in-Slavnov 78)

Proof

Isotony is immediate from the definition of the algebra homomorphisms in def. .

By the isomorphism (237) we may check causal localizy with respect to any choice of adiabatic switching $g_{sw} \in Cautoff(\mathcal{O})$ constant over $\mathcal{O}$ . For this the statement follows, with the assumption of spacelike separation, by causal additivity (prop. ):

For $supp(A_1) \subset \mathcal{O}_1$ and $supp(A_2) \subset \mathcal{O}_2$ we have:

\begin{aligned} \mathcal{Z}_{g S_{int,sw}}( j A_1 ) \mathcal{Z}_{g S_{int,sw}}( j A_2 ) & = \mathcal{S}_{g S_{int,sw}}( j A_1 + j A_2) \\ & = \mathcal{S}_{g S_{int,sw}}( j A_2 + j A_1) \\ & = \mathcal{Z}_{g S_{int,sw}}( j A_2 ) \mathcal{Z}_{g S_{int,sw}}( j A_1 ) \end{aligned}

With the causally local net of localized generating functions for interacting field observables in hand, it is now immediate to get the

Definition

(system of interacting field algebras of observables)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g ] ]

be a Lagrangian density, to be thought of as an interaction, so that for $g_{sw} \in C^\infty_{sp}(\Sigma)\langle g \rangle$ an adiabatic switching the transgression

g S_{int,sw} \;\coloneqq\; g \tau_\Sigma(g_{sw} \mathbf{L}_{int}) \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g ] ]\langle g \rangle

is a local observable, to be thought of as an adiabatically switched interaction action functional.

For $\mathcal{O} \subset \Sigma$ a causally closed subset of spacetime (def. ) and for $g_{sw} \in Cutoffs(\mathcal{O})$ an compatible adiabatic switching function (def. ) write

IntObs(E_{\text{BV-BRST}}, S_{int,sw})(\mathcal{O}) \coloneqq \left\langle i \hbar \frac{d}{d j} \mathcal{Z}_{S_{int}}(j A)\vert_{j = 0} \;\vert\; supp(A) \subset \mathcal{O} \right\rangle \;\subset\; PolyObs((\hbar))[ [ g ] ]

for the interacting field algebra of observables (def. ) with spacetime support in $\mathcal{O}$ .

Let then

IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}) \subset \underset{g_{sw} \in Cutoffs(\mathcal{O})}{\prod} IntObs(E_{\text{BV-BRST}}, S_{int,sw})(\mathcal{O})

be the subalgebra of the Cartesian product of all these algebras as $g_{sw}$ ranges, which is generated by the tuples

i \hbar \frac{d}{d j } \mathcal{Z}_{\mathbf{L}_{int}}\vert_{j = 0} \;\coloneqq\; \left( i \hbar \frac{d}{d j } \mathcal{Z}_{S_{int,sw}} (j A)\vert_{j = 0} \right)_{g_{sw} \in Cutoffs(\mathcal{O})}

for $supp(A) \subset \mathcal{O}$ .

Finally, for $\mathcal{O}_1 \subset \mathcal{O}_2$ an inclusion of two causally closed subsets, let

i_{\mathcal{O}_1, \mathcal{O}_2} \;\colon\; IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_1) \longrightarrow IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_2)

be the algebra homomorphism which is given simply by restricting the index set of tuples.

This construction defines a functor

IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int}) \;\colon\; CausClsdSubsets(\Sigma) \longrightarrow Algebras

from the poset of causally closed subsets in the spacetime $\Sigma$ to the category of star algebras.

Finally, as a direct corollary of prop. , we obtain the key result:

Proposition

(system of interacting field algebras of observables is causally local)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be a corresponding S-matrix scheme according to def. , and let

\mathbf{L}_{int} \;\in\; \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})[ [ \hbar , g ] ] \,.

be a Lagrangian density, to be thought of as an interaction, then the system of algebras of observables $Obs_{L_{int}}$ (def. ) is a local net of observables in that

(isotony) For every inclusion $\mathcal{O}_1 \subset \mathcal{O}_2$ of causally closed subsets the corresponding algebra homomorphism is a monomorphism

$i_{\mathcal{O}_1, \mathcal{O}_2} \;\colon\; IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_1) \hookrightarrow IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}_2)$
(causal locality) For $\mathcal{O}_1, \mathcal{O}_2 \subset X$ two causally closed subsets which are spacelike separated, in that their causal ordering (def. ) satisfies

$\mathcal{O}_1 {\vee\!\!\!\wedge} \mathcal{O}_2 \;\text{and}\; \mathcal{O}_2 {\vee\!\!\!\wedge} \mathcal{O}_1$

and for $\mathcal{O} \subset \Sigma$ any further causally closed subset which contains both

$\mathcal{O}_1 , \mathcal{O}_2 \subset \mathcal{O}$

then the corresponding images of the generating algebras of $\mathcal{O}_1$ and $\mathcal{O}_2$ , respectively, commute with each other as subalgebras of the generating algebra of $\mathcal{O}$ :

$\left[ i_{\mathcal{O}_1,\mathcal{O}}(Obs_{\mathbf{L}_{int}}(\mathcal{O}_1)) \;,\; i_{\mathcal{O}_2,\mathcal{O}}(Obs_{\mathbf{L}_{int}}(\mathcal{O}_2)) \right] \;=\; 0 \;\;\; \in IntObs(E_{\text{BV-BRST}}, \mathbf{L}_{int})(\mathcal{O}) \,.$

(Dütsch-Fredenhagen 00, below (17), following Brunetti-Fredenhagen 99, section 8, Il’in-Slavnov 78)

Proof

The first point is again immediate from the definition (def. ).

For the second point it is sufficient to check the commutativity relation on generators. For these the statement follows with prop. :

\begin{aligned} & \left[ i \hbar \frac{d}{d j} \mathcal{Z}_{S_{int,sw}}(j A_1)\vert_{j = 0} \;,\; i \hbar \frac{d}{d j} \mathcal{Z}_{S_{int,sw}}(j J_2)\vert_{j = 0} \right] \\ & = (i \hbar)^2 \frac{ \partial^2 }{ \partial j_1 \partial j_2 } \underset{ = 0}{ \underbrace{ \left[ \mathcal{Z}_{S_{int,sw}}(j_1 A_1) \;,\; \mathcal{Z}_{S_{int,sw}}(j_1 A_2) \right]}}_{ \left\vert { {j_1 = 0} \atop {j_2 = 0} } \right. } \\ & = 0 \end{aligned}

$\,$

time-ordered products

Definition suggests to focus on the multilinear operations $T(...)$ which define the perturbative S-matrix order-by-order in $\hbar$ . We impose axioms on these time-ordered products directly (def. ) and then prove that these axioms imply the axioms for the corresponding S-matrix (prop. below).

Definition

(time-ordered products)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a free vacuum according to def. .

A time-ordered product is a sequence of multi-linear continuous functionals for all $k \in \mathbb{N}$ of the form

T_k \;\colon\; \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ]\langle g,j \rangle {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j] ]}} \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}((\hbar))[ [g , j] ]

(from tensor products of local observables to microcausal polynomial observables, with formal parameters adjoined according to def. ) such that the following conditions hold for all possible arguments:

(normalization)

$T_0(O) = 1$
(perturbation)

$T_1(O) = :O:$
(symmetry) each $T_k$ is symmetric in its arguments, in that for every permutation $\sigma \in \Sigma(k)$ of $k$ elements

$T_k(O_{\sigma(1)}, O_{\sigma(2)}, \cdots, O_{\sigma(k)}) \;=\; T_k(O_1, O_2, \cdots, O_k)$
(causal factorization) If the spacetime support (def. ) of local observables satisfies the causal ordering (def. )

$\left( {\, \atop \,} supp(O_1) \cup \cdots \cup supp(O_r) {\, \atop \,} \right) \;{\vee\!\!\!\wedge}\; \left( {\, \atop \,} supp(O_{r+1}) \cup \cdots \cup supp(O_k) {\, \atop \,} \right)$

then the time-ordered product of these $k$ arguments factors as the Wick algebra-product of the time-ordered product of the first $r$ and that of the second $k-r$ arguments:

$T(O_1, \cdots, O_k) \; = \; T( O_1, \cdots , O_r ) \, T( O_{r+1}, \cdots , O_k ) \,.$

Example

(S-matrix scheme implies time-ordered products)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. and let

\mathcal{S} \;=\; \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!}\frac{1}{(i \hbar)^k} T_k

be a corresponding S-matrix scheme according to def. .

Then the $\{T_k\}_{k \in \mathbb{N}}$ are time-ordered products in the sense of def. .

Proof

We need to show that the $\{T_k\}_{k \in \mathbb{N}}$ satisfy causal factorization.

For

O_j\;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle

a local observable, consider the continuous linear function that muliplies this by any real number

\array{ \mathbb{R} &\longrightarrow& LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle \\ \kappa_j &\mapsto& \kappa_j O_j } \,.

Since the $T_k$ by definition are continuous linear functionals, they are in particular differentiable maps, and hence so is the S-matrix $\mathcal{S}$ . We may extract $T_k$ from $\mathcal{S}$ by differentiation with respect to the parameters $\kappa_j$ at $\kappa_j = 0$ :

T_k(O_1, \cdots, O_k) \;=\; \frac{\partial^k}{ \partial \kappa_1 \cdots \partial \kappa_k } \mathcal{S}\left( \kappa_1 O_1 + \cdots + \kappa_k O_k \right)\vert_{\kappa_1, \cdots, \kappa_k = 0}

for all $k \in \mathbb{N}$ .

Now the causal additivity of the S-matrix $\mathcal{S}$ implies its causal factorization (remark ) and this implies the causal factorization of the $\{T_k\}$ by the product law of differentiation:

\begin{aligned} T_k(O_1, \cdots, O_k) & = (i \hbar)^k \frac{\partial^k}{ \partial \kappa_1 \cdots \partial \kappa_k } \mathcal{S}\left( \kappa_1 O_1 + \cdots + \kappa_k O_k \right)\vert_{\kappa_1, \cdots, \kappa_k = 0} \\ & = (i \hbar)^k \frac{\partial^k}{ \partial \kappa_1 \cdots \partial \kappa_k } \left( {\, \atop \,} \mathcal{S}(\kappa_1 O_1 + \cdots + \kappa_r O_r) \, \mathcal{S}(\kappa_{r+1} O_{r+1} + \cdots + \kappa_k O_k) {\, \atop \,} \right) \vert_{\kappa_1, \cdots, \kappa_k = 0} \\ & = (i \hbar)^r \frac{\partial^r}{ \partial \kappa_1 \cdots \partial \kappa_r } \mathcal{S}(\kappa_1 O_1 + \cdots + \kappa_r O_r) \vert_{\kappa_1, \cdots, \kappa_r = 0} \; (i \hbar)^{k-r} \frac{\partial^{k-r}}{ \partial \kappa_{r+1} \cdots \partial \kappa_k } \mathcal{S}(\kappa_{r+1} O_{r+1} + \cdots + \kappa_k O_k) \vert_{\kappa_{r+1}, \cdots, \kappa_k = 0} \\ & = T_{r}( O_1, \cdots, O_{r} ) \, T_{k-r}( O_{r+1}, \cdots, O_{k} ) \end{aligned} \,.

The converse implication, that time-ordered products induce an S-matrix scheme involves more work (prop. below).

Remark

(time-ordered products as generalized functions)

It is convenient (as in Epstein-Glaser 73) to think of time-ordered products (def. ), being Wick algebra-valued distributions (hence operator-valued distributions if we were to choose a representation of the Wick algebra by linear operators on a Hilbert space), as generalized functions depending on spacetime points:

\left\{ \alpha_i \in \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})\langle g \rangle \right\} \cup \left\{ \beta_j \in \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})\langle j \rangle \right\}

is a finite set of horizontal differential forms, and

\left\{ g_i, j_{j} \in C^\infty_{cp}(\Sigma) \right\}

is a corresponding set of bump functions on spacetime (adiabatic switchings), so that

\left\{ S_j \colon \Phi \mapsto \underset{\Sigma}{\int} g_i(x) \, \left(j^\infty_\Sigma(\Phi)^\ast \alpha_i\right)(x)\, dvol_\Sigma(x) \right\} \;\cup\; \left\{ A_j \colon \Phi \mapsto \underset{\Sigma}{\int} j_i(x) \, \left(j^\infty_\Sigma(\Phi)^\ast \beta_i\right)(x)\, dvol_\Sigma(x) \right\}

is the corresponding set of local observables, then we may write the time-ordered product of these observables as the integration of these bump functions against a generalized function $T_{(\alpha_i)}$ with values in the Wick algebra:

\begin{aligned} & \underset{\Sigma^n}{\int} T_{(\alpha_i), (\beta_j)}(x_1, \cdots, x_{r}, x_{r+1}, \cdots x_{n}) g_1(x_1) \cdots g_r(x_r) \, j_1(x_{r+1}) \cdots j_n(x_n) \, dvol_{\Sigma^n}(x_1, \cdots x_n) \\ & \coloneqq T( S_1, \cdots, S_r, A_{r+1}, \cdots, A_n ) \end{aligned} \,.

Moreover, the subscripts on these generalized functions will always be clear from the context, so that in computations we may notationally suppress these.

Finally, due to the “symmetry” axiom in def. , a time-ordered product depends, up to signs, only on its set of arguments, not on the order of the arguments. We will write $\mathbf{X} \coloneqq \{x_1, \cdots, x_r\}$ and $\mathbf{Y} \coloneqq \{y_1, \cdots y_r\}$ for sets of spacetime points, and hence abbreviate the expression for the “value” of the generalized function in the above as $T(\mathbf{X}, \mathbf{Y})$ etc.

In this condensed notation the above reads

\underset{\Sigma^{r+s}}{\int} T(\mathbf{X}, \mathbf{Y}) \, g_1(x_1) \cdots g_r(x_r) j_{r+1}(x_{r+1}) \cdots j_n(x_n) \, dvol_{\Sigma^{r+s}}(\mathbf{X}) \,.

This condensed notation turns out to be greatly simplify computations, as it absorbs all the “relative” combinatorial prefactors:

Example

(product of perturbation series in generalized function-notation)

Let

U(g) \coloneqq \underoverset{n = 0}{\infty}{\sum} \frac{1}{n!} \int U(x_1, \cdots, x_n) \, g(x_1) \cdots g(x_n) \, dvol

and

V(g) \coloneqq \underoverset{n = 0}{\infty}{\sum} \frac{1}{n!} \int V(x_1, \cdots, x_n) \, g(x_1) \cdots g(x_n) \, dvol

be power series of Wick algebra-valued distributions in the generalized function-notation of remark .

Then their product $W(g) \coloneqq U(g) V(g)$ with generalized function-representation

W(g) \coloneqq \underoverset{n = 0}{\infty}{\sum} \frac{1}{n!} \int W(x_1, \cdots, x_n) \, g(x_1) \cdots g(x_n) \, dvol

is given simply by

W(\mathbf{X}) \;=\; \underset{\mathbf{I} \subset \mathbf{X}}{\sum} U(\mathbf{I}) V(\mathbf{X} \setminus \mathbf{I}) \,.

(Epstein-Glaser 73 (5))

Proof

For fixed cardinality ${\vert \mathbf{I} \vert} = n_1$ the sum over all subsets $\mathbf{I} \subset \mathbf{X}$ overcounts the sum over partitions of the coordinates as $(x_1, \cdots x_{n_1}, x_{n_1 + 1}, \cdots x_n)$ precisely by the binomial coefficient $\frac{n!}{n_1! (n - n_1) !}$ . Here the factor of $n!$ cancels against the “global” combinatorial prefactor in the above expansion of $W(g)$ , while the remaining factor $\frac{1}{n_1! (n - n_1) !}$ is just the “relative” combinatorial prefactor seen at total order $n$ when expanding the product $U(g)V(g)$ .

In order to prove that the axioms for time-ordered products do imply those for a perturbative S-matrix (prop. below) we need to consider the corresponding reverse-time ordered products:

Definition

(reverse-time ordered products)

Given a time-ordered product $T = \{T_k\}_{k \in \mathbb{N}}$ (def. ), its reverse-time ordered product

\overline{T}_k \;\colon\; \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ] {\, \atop \,} \right) \longrightarrow PolyObs(E_{\text{BV-BRST}})((\hbar))[ [g, j] ]

for $k \in \mathbb{N}$ is defined by

\overline{T}( A_1 \cdots A_n ) \;\coloneqq\; \left\{ \array{ \underoverset{r = 1}{n}{\sum} (-1)^r \underset{\sigma \in Unshuffl(n,r)}{\sum} T( A_{\sigma(1)} \cdots A_{\sigma(k_1)} ) \, T( A_{\sigma(k_1 + 1)} \cdots A_{\sigma(k_2)} ) \cdots T( A_{\sigma(k_{r-1}+1)} \cdots A_{\sigma_{k_r}} ) &\vert& k \geq 1 \\ 1 &\vert& k = 0 } \right. \,,

where the sum is over all unshuffles $\sigma$ of $(1 \leq \cdots \leq n)$ into $r$ non-empty ordered subsequences. Alternatively, in the generalized function-notation of remark , this reads

\overline{T}( \mathbf{X} ) = \underoverset{r = 1}{{\vert \mathbf{X} \vert}}{\sum} (-1)^r \underset{ \array{ \mathbf{I}_1, \cdots, \mathbf{I}_r \neq \emptyset \\ \underset{j \neq k}{\forall}\left( \mathbf{I}_j \cap \mathbf{I}_k = \emptyset \right) \\ \mathbf{I}_1 \cup \cdots \cup \mathbf{I}_r = \mathbf{X} } }{\sum} T( \mathbf{I}_1 ) \cdots T(\mathbf{I}_r)

(Epstein-Glaser 73, (11))

Proposition

(reverse-time ordered products express inverse S-matrix)

Given time-ordered products $T(-)$ (def. ), then the corresponding reverse time-ordered product $\overline{T}(-)$ (def. ) expresses the inverse $S(-)^{-1}$ (according to remark ) of the corresponding perturbative S-matrix scheme $\mathcal{S}(S_{int}) \coloneqq \underset{k \in \mathbb{N}}{\sum} \tfrac{1}{k!} T(\underset{k\,\text{args}}{\underbrace{S_{int}, \cdots , S_{int}}})$ (def. ):

\left( {\, \atop \,} \mathcal{S}(g S_{int} + j A ) {\, \atop \,} \right)^{-1} \;=\; \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!} \left( \frac{1}{i \hbar} \right)^k \overline{T}( \underset{k \, \text{arguments}}{\underbrace{ (g S_{int} + j A), \cdots, (g S_{int} + j A)}} ) \,.

Proof

For brevity we write just “ $A$ ” for $\tfrac{1}{i \hbar}(g S_{int} + j A)$ . (Hence we assume without restriction that $A$ is not independent of powers of $g$ and $j$ ; this is just for making all sums in the following be order-wise finite sums.)

By definition we have

\begin{aligned} & \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!} \overline{T}( \underset{k \, \text{args}}{\underbrace{A, \cdots , A}} ) \\ & = \underset{ k \in \mathbb{N}}{\sum} \frac{1}{k!} \underoverset{r = 1}{k}{\sum} (-1)^r \!\!\!\underset{\sigma \in Unshuffl(k,r)}{\sum}\!\!\! T( A_{\sigma(1)} \cdots A_{\sigma(k_1)} ) T( A_{\sigma(k_1 + 1)} \cdots A_{\sigma(k_2)} ) \cdots T( A_{\sigma(k_{r-1}+1)} \cdots A_{\sigma_{k_r}} ) \end{aligned}

where all the $A_k$ happen to coincide: $A_k = A$ .

If instead of unshuffles (i.e. partitions into non-empty subsequences preserving the original order) we took partitions into arbitrarily ordered subsequences, we would be overcounting by the factorial of the length of the subsequences, and hence the above may be equivalently written as:

\cdots = \underset{k \in \mathbb{N}}{\sum} \tfrac{1}{k!} \underoverset{r = 1}{k}{\sum} (-1)^r \!\!\! \underset{ {\sigma \in \Sigma(k)} \atop { { k_1 + \cdots + k_r = k } \atop { \underset{i}{\forall} (k_i \geq 1) } } }{\sum} \!\!\! \tfrac{1}{k_1!} \cdots \tfrac{1}{k_r !} \, T( A_{\sigma(1)} \cdots A_{\sigma(k_1)} ) \, T( A_{\sigma(k_1 + 1)} \cdots A_{\sigma(k_2)} ) \cdots T( A_{\sigma(k_{r-1}+1)} \cdots A_{\sigma_{k_r}} ) \,,

where $\Sigma(k)$ denotes the symmetric group (the set of all permutations of $k$ elements).

Moreover, since all the $A_k$ are equal, the sum is in fact independent of $\sigma$ , it only depends on the length of the subsequences. Since there are $k!$ permutations of $k$ elements the above reduces to

\begin{aligned} \cdots & = \underset{k \in \mathbb{N}}{\sum} \underoverset{r = 1}{k}{\sum} (-1)^r \!\!\! \underset{ k_1 + \cdots + k_r = k }{\sum} \tfrac{1}{k_1!} \cdots \tfrac{1}{k_r !} T( \underset{k_1 \, \text{factors}}{\underbrace{ A, \cdots , A }} ) T( \underset{k_2 \, \text{factors}}{\underbrace{ A, \cdots , A }} ) \cdots T( \underset{k_r \, \text{factors}}{\underbrace{ A, \cdots , A }} ) \\ & = \underoverset{r = 0}{\infty}{\sum} \left( - \underoverset{k = 0}{\infty}{\sum} T ( \underset{k\,\text{factors}}{\underbrace{A, \cdots , A}} ) \right)^r \\ & = \mathcal{S}(A)^{-1} \,, \end{aligned}

where in the last line we used (231).

In fact prop. is a special case of the following more general statement:

Proposition

(inversion relation for reverse-time ordered products)

Let $\{T_k\}_{k \in \mathbb{N}}$ be time-ordered products according to def. . Then the reverse-time ordered products according to def. satisfies the following inversion relation for all $\mathbf{X} \neq \emptyset$ (in the condensed notation of remark ):

\underset{\mathbf{J} \subset \mathbf{X}}{\sum} T(\mathbf{J}) \overline{T}(\mathbf{X} \setminus \mathbf{J}) \;=\; 0

and

\underset{\mathbf{J} \subset \mathbf{X}}{\sum} \overline{T}(\mathbf{X} \setminus \mathbf{J}) T(\mathbf{J}) \;=\; 0

Proof

This is immediate from unwinding the definitions.

Proposition

(reverse causal factorization of reverse-time ordered products)

Let $\{T_k\}_{k \in \mathbb{N}}$ be time-ordered products according to def. . Then the reverse-time ordered products according to def. satisfies reverse-causal factorization.

(Epstein-Glaser 73, around (15))

Proof

In the condensed notation of remark , we need to show that for $\mathbf{X} = \mathbf{P} \cup \mathbf{Q}$ with $\mathbf{P} \cap \mathbf{Q} = \emptyset$ then

\left( \mathbf{P} {\vee\!\!\!\wedge} \mathbf{Q} \right) \;\Rightarrow\; \left( \overline{T}(\mathbf{X}) = \overline{T}(\mathbf{Q}) \overline{T}(\mathbf{P}) \right) \,.

We proceed by induction. If ${\vert \mathbf{X}\vert} = 1$ the statement is immediate. So assume that the statement is true for sets of cardinality $n \geq 1$ and consider $\mathbf{X}$ with ${\vert \mathbf{X}\vert} = n+1$ .

We make free use of the condensed notation as in example .

From the formal inversion

\underset{\mathbf{J} \subset \mathbf{X}}{\sum} \overline{T}(\mathbf{J}) T(\mathbf{X}\setminus \mathbf{J}) = 0

(which uses the induction assumption that ${\vert \mathbf{X}\vert} \geq 1$ ) it follows that

\begin{aligned} \overline{T}(\mathbf{X}) & = - \underset{ { \mathbf{J} \subset \mathbf{X} } \atop { \mathbf{J} \neq \mathbf{X} } }{\sum} \overline{T}(\mathbf{J}) T( \mathbf{X} \setminus \mathbf{J} ) \\ & = - \underset{ { \mathbf{J} \cup \mathbf{J}' = \mathbf{X} } \atop { { \mathbf{J} \cap \mathbf{J}' = \emptyset } \atop { \mathbf{J}' \neq \emptyset } } }{\sum} \overline{T}( \mathbf{Q} \cap \mathbf{J} ) \overline{T}( \mathbf{P} \cap \mathbf{J} ) T ( \mathbf{P} \cap ( \mathbf{J}' ) ) T ( \mathbf{Q} \cap ( \mathbf{J}' ) ) \\ & = - \underset{ { \mathbf{L} \cup \mathbf{L}' = \mathbf{Q} \,,\, \mathbf{L} \cap \mathbf{L}' = \emptyset } \atop { \mathbf{L}' \neq \emptyset } }{\sum} \!\!\! \overline{T}( \mathbf{L} ) \underset{ = 0}{ \underbrace{ \left( \underset{ \mathbf{K} \subset \mathbf{P} }{\sum} \overline{T}( \mathbf{K} ) T( \mathbf{P} \setminus \mathbf{K}) \right) } } T(\mathbf{L'}) - \overline{T}(\mathbf{Q}) \underset{ = - \overline{T}(\mathbf{P}) }{ \underbrace{ \underset{ {\mathbf{K} \subset \mathbf{P}} \atop { \mathbf{K} \neq \emptyset } }{\sum} \overline{T}(\mathbf{K}) T (\mathbf{P} \setminus \mathbf{K} ) }} \\ & = \overline{T}(\mathbf{Q}) \overline{T}(\mathbf{P}) \end{aligned} \,.

Here

in the second line we used that $\mathbf{X} = \mathbf{Q} \sqcup \mathbf{P}$ , together with the

causal factorization property of $T(-)$ (which holds by def. ) and that of $\overline{T}(-)$

(which holds by the induction assumption, using that $\mathbf{J} \neq \mathbf{X}$ hence that ${\vert \mathbf{J}\vert} \lt {\vert \mathbf{X}\vert}$ ).

in the third line we decomposed the sum over $\mathbf{J}, \mathbf{J}' \subset \mathbf{X}$ into two sums over subsets of $\mathbf{Q}$ and $\mathbf{P}$ :
1. The first summand in the third line is the contribution where $\mathbf{J}'$ has a non-empty intersection with $\mathbf{Q}$ . This makes $\mathbf{K}$ range without constraint, and therefore the sum in the middle vanishes, as indicated, as it is the contribution at order ${\vert \mathbf{Q}\vert}$ of the inversion formula from prop. .
2. The second summand in the third line is the contribution where $\mathbf{J}'$ does not intersect $\mathbf{Q}$ . Now the sum over $\mathbf{K}$ is the inversion formula from prop. except for one term, and so it equals that term.

Using these facts about the reverse-time ordered products, we may finally prove that time-ordered products indeed do induced a perturbative S-matrix:

Proposition

(time-ordered products induce S-matrix)

Let $\{T_k\}_{k \in \mathbb{N}}$ be a system of time-ordered products according to def. . Then

\begin{aligned} \mathcal{S}(-) & \coloneqq T \left( \exp_\otimes \left( \tfrac{1}{i \hbar}(-) \right) \right) \\ & \coloneqq \underset{k \in \mathbb{N}}{\sum} \tfrac{1}{k!} \tfrac{1}{(i \hbar)^k} T( \underset{k \, \text{factors}}{\underbrace{-, \cdots , -}} ) \end{aligned}

is indeed a perturbative S-matrix according to def. .

Proof

The axiom “perturbation” of the S-matrix is immediate from the axioms “perturbation” and “normalization” of the time-ordered products. What requires proof is that causal additivity of the S-matrix follows from the causal factorization property of the time-ordered products.

Notice that also the weaker causal factorization property of the S-matrix (remark ) is immediate from the causal factorization condition on the time-ordered products.

But causal additivity is stronger. It is remarkable that this, too, follows from just the time-ordering (Epstein-Glaser 73, around (73)):

To see this, first expand the generating function $\mathcal{Z}$ (232) into powers of $g$ and $j$

\mathcal{Z}_{g S_{int}}(j A) \;=\; \underoverset{n,m = 0}{\infty}{\sum} \frac{1}{n! m!} R\left( {\, \atop \,} \underset{n\, \text{factors}}{\underbrace{g S_{int}, \cdots ,g S_{int}}}, ( \underset{m \, \text{factors}}{ \underbrace{ j A , \cdots , j A } } ) {\, \atop \,} \right)

and then compare order-by-order with the given time-ordered product $T$ and its induced reverse-time ordered product (def. ) via prop. . (These $R(-,-)$ are also called the “generating retarded products, discussed in their own right around def. below.)

In the condensed notation of remark and its way of absorbing combinatorial prefactors as in example this yields at order $(g/\hbar)^{\vert \mathbf{Y}\vert} (j/\hbar)^{\vert \mathbf{X}\vert}$ the coefficient

(238)

R(\mathbf{Y}, \mathbf{X}) \;=\; \underset{\mathbf{I} \subset \mathbf{Y}}{\sum} \overline{T}(\mathbf{I}) T( (\mathbf{Y} \setminus \mathbf{I}) , \mathbf{X} ) \,.

We claim now that the support of $R$ is inside the subset for which $\mathbf{Y}$ is in the causal past of $\mathbf{X}$ . This will imply the claim, because by multi-linearity of $R(-,-)$ it then follows that

\left(supp(A_1) {\vee\!\!\!\wedge} supp(A_2)\right) \Rightarrow \left( Z_{(g S_{int} + j A_1)}(j A_2) = Z_{S_{int}}(A_2) \right)

and by prop. this is equivalent to causal additivity of the S-matrix.

It remains to prove the claim:

Consider $\mathbf{X}, \mathbf{Y} \subset \Sigma$ such that the subset $\mathbf{P} \subset \mathbf{Y}$ of points not in the past of $\mathbf{X}$ , hence the maximal subset with causal ordering

\mathbf{P} {\vee\!\!\!\wedge} \mathbf{X} \,,

is non-empty. We need to show that in this case $R(\mathbf{Y}, \mathbf{X}) = 0$ (in the sense of generalized functions).

Write $\mathbf{Q} \coloneqq \mathbf{Y} \setminus \mathbf{P}$ for the complementary set of points, so that all points of $\mathbf{Q}$ are in the past of $\mathbf{X}$ . Notice that this implies that $\mathbf{P}$ is also not in the past of $\mathbf{Q}$ :

\mathbf{P} {\vee\!\!\!\wedge} \mathbf{Q} \,.

With this decomposition of $\mathbf{Y}$ , the sum in (238) over subsets $\mathbf{I}$ of $\mathbf{Y}$ may be decomposed into a sum over subsets $\mathbf{J}$ of $\mathbf{P}$ and $\mathbf{K}$ of $\mathbf{Q}$ , respectively. These subsets inherit the above causal ordering, so that by the causal factorization property of $T(-)$ (def. ) and $\overline{T}(-)$ (prop. ) the time-ordered and reverse time-ordered products factor on these arguments:

\begin{aligned} R(\mathbf{Y}, \mathbf{X}) & = \underset{ {\mathbf{J} \subset \mathbf{P}} \atop { \mathbf{K} \subset \mathbf{Q} } }{\sum} \, \overline{T}( \mathbf{J} \cup \mathbf{K} ) T( (\mathbf{P} \setminus \mathbf{J}) \cup (\mathbf{Q} \setminus \mathbf{K}), \mathbf{X} ) \\ & = \underset{ {\mathbf{J} \subset \mathbf{P}} \atop { \mathbf{K} \subset \mathbf{Q} } }{\sum} \, \overline{T}( \mathbf{K} ) \overline{T}( \mathbf{J} ) T( \mathbf{P} \setminus \mathbf{J} ) T( \mathbf{Q} \setminus \mathbf{K}, \mathbf{X} ) \\ & = \underset{ \mathbf{K} \subset \mathbf{Q} }{\sum} \overline{T}(\mathbf{K}) \underset{= 0}{ \underbrace{ \left( \underset{\mathbf{J} \subset \mathbf{P}}{\sum} \overline{T}(\mathbf{J}) T( \mathbf{P} \setminus \mathbf{J} ) \right) }} T(\mathbf{Q} \setminus \mathbf{K}, \mathbf{X}) \end{aligned} \,.

Here the sub-sum in brackets vanishes by the inversion formula, prop. .

In conclusion:

Proposition

(S-matrix scheme via causal factorization)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. and consider a function

\mathcal{S} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j \rangle \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g, j] ]

from local observables to microcausal polynomial observables which satisfies the condition “perturbation” from def. . Then the following two conditions on $\mathcal{S}$ are equivalent

causal additivity (def. )
causal factorization (remark )

and hence either of them is necessary and sufficient for $\mathcal{S}$ to be a perturbative S-matrix scheme according to def. .

Proof

That causal factorization follows from causal additivity is immediate (remark ).

Conversely, causal factorization of $\mathcal{S}$ implies that its expansion coefficients $\{T_k\}_{k \in \mathbb{N}}$ are time-ordered products (def. ), via the proof of example , and this implies causal additivity by prop. .

$\,$

(“Re”-)Normalization

We discuss now that time-ordered products as in def. , hence, by prop. , perturbative S-matrix schemes (def. ) exist in fact uniquely away from coinciding interaction points (prop. below).

This means that the construction of full time-ordered products/S-matrix schemes may be phrased as an extension of distributions of time-ordered products to the diagonal locus of coinciding spacetime arguments (prop. below). This choice in their definition is called the choice of ("re"-)normalization of the time-ordered products (remark ), and hence of the interacting pQFT that these define (def. below).

The space of these choices may be accurately characterized, it is a torsor over a group of re-definitions of the interaction-terms, called the “Stückelberg-Petermann renormalization group”. This is called the main theorem of perturbative renormalization, theorem below.

Here we discuss just enough of the ingredients needed to state this theorem. We give the proof in the next chapter.

$\,$

Definition

(tuples of local observables with pairwise disjoint spacetime support)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

For $k \in \mathbb{N}$ , write

\left( {\, \atop \,} LocPoly(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ] {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j ] ]}}_{pds} \hookrightarrow \left( {\, \atop \,} LocPoly(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ] {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j ] ]}}

for the linear subspace of the $k$ -fold tensor product of local observables (as in def. , def. ) on those tensor products $A_1 \otimes \cdots A_k$ of tuples with disjoint spacetime support:

supp(A_j) \cap supp(A_k) = \emptyset \phantom{AAA} \text{for} \, i \neq j \in \{1, \cdots, k\} \,.

Proposition

(time-ordered product unique away from coinciding spacetime arguments)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , and let $T = \{T_k\}_{k \in \mathbb{N}}$ be a sequence of time-ordered products (def. )

\array{ \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ] {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j] ]}} & \longrightarrow & PolyObs(E_{\text{BV-BRST}})_{reg}((\hbar))[ [g , j] ] \\ \uparrow & \nearrow_{(-) \star_F \cdots \star_F (-)} \\ \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ] {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j] ]}}_{pds} }

Then their restriction to the subspace of tuples of local observables of pairwise disjoint spacetime support (def. ) is unique (independent of the "re-"normalization freedom in choosing $T$ ) and is given by the star product

A_1 \star_{F} A_2 \;\coloneqq\; ((-)\cdot (-)) \circ \exp\left( \hbar \left( \underset{\Sigma \times \Sigma}{\int} \Delta_F^{a b}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \, dvol_\Sigma(x)\, dvol_\Sigma(y) \right) \right) (A_1 \otimes A_2)

that is induced (def. ) by the Feynman propagator $\Delta_F \coloneqq \tfrac{i}{2}(\Delta_+ + \Delta_- + H)$ (corresponding to the Wightman propagator $\Delta_H = \tfrac{i}{2}(\Delta_+ - \Delta_-) + H$ which is given by the choice of free vacuum), in that

T \left( {\, \atop \,} A_1 , \cdots, A_k {\, \atop \,} \right) \;=\; A_1 \star_F \cdots \star_F A_k \,.

In particular the time-ordered product extends from the restricted domain of tensor products of local observables to a restricted domain of microcausal polynomial observables, where it becomes an associative product:

(239)

\begin{aligned} T(A_1, \cdots, A_{k_n}) & = T(A_1, \cdots, A_{k_1}) \star_F T(A_{k_1 + 1}, \cdots, A_{k_2}) \star_F \cdots \star_F T(A_{k_{n-1} + 1}, \cdots, A_{k_n}) \\ & = A_1 \star_F \cdots \star_F A_{k_n} \end{aligned}

for all tuples of local observables $A_1, \cdots, A_{k_1}, A_{k_1+1}, \cdots, A_{k_2}, \cdots, \cdots A_{k_n}$ with pairwise disjoint spacetime support.

The idea of this statement goes back at least to Epstein-Glaser 73, as in remark . One formulation appears as (Brunetti-Fredenhagen 00, theorem 4.3). The above formulation in terms of the star product is stated in (Fredenhagen-Rejzner 12, p. 27, Dütsch 18, lemma 3.63 (b)).

Proof

By induction over the number of arguments, it is sufficient to see that, more generally, for $A_1, A_2 \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]$ two microcausal polynomial observables with disjoint spacetime support the star product $A_1 \star_F A_2$ is well-defined and satisfies causal factorization.

Consider two partitions of unity

(\chi_{1,i} \in C^\infty_{cp}(\Sigma))_{i} \phantom{AAA} (\chi_{1,j} \in C^\infty_{cp}(\Sigma))_{j}

and write $(A_{1,i})_i$ and $(A_{2,j})_{j}$ for the collection of microcausal polynomial observables obtained by multiplying all the distributional coefficients of $A_1$ and of $A_2$ with $\chi_{1,i}$ and with $\chi_{2,j}$ , respectively, for all $i$ and $j$ , hence such that

A_1 \;=\; \underset{i}{\sum} A_{1,i} \phantom{AAA} A_2 \;=\; \underset{j}{\sum} A_{1,j} \,.

By linearity, it is sufficient to prove that $A_{1,i} \star_F A_{2,j}$ is well defined for all $i,j$ and satisfies causal factorization.

Since the spacetime supports of $A_1$ and $A_2$ are assumed to be disjoint

supp(A_1) \cap supp(A_2) \;=\; \emptyset

we may find partitions such that each resulting pair of smaller supports is in fact in causal order-relation:

\array{ \left( supp(A_1) \cap supp(\chi_{1,i}) \right) {\vee\!\!\!\wedge} \left( supp(A_2) \cap supp(\chi_{2,j}) \right) \\ \text{or} \\ \left( supp(A_2) \cap supp(\chi_{2,j}) \right) {\vee\!\!\!\wedge} \left( supp(A_1) \cap supp(\chi_{1,u}) \right) } \phantom{AAAAA} \text{for all}\,\, i,j \,.

But now it follows as in the proof of prop. ) via (?) that

A_{1,i} \star_F A_{2,j} \;=\; \left\{ \array{ A_{1,i} \star_H A_{2,j} &\vert& supp(A_{1,i}) {\vee\!\!\!\wedge} supp(A_{2,j}) \\ A_{2,j} \star_H A_{1,i} &\vert& supp(A_{2,j}) {\vee\!\!\!\wedge} supp(A_{1,i}) } \right.

Finally the associativity-statement follows as in prop. .

Before using the unqueness of the time-ordered products away from coinciding spacetime arguments (prop. ) to characterize the freedom in ("re"-)normalizing time-ordered products, we pause to observe that in the same vein the time-ordered products have a unique extension of their domain also to regular polynomial observables. This is in itself a trivial statement (since all star products are defined on regular polynomial observables, def. ) but for understanding the behaviour under ("re"-)normalization of other structures, such as the interacting BV-differential (def. below) it is useful to understand renormalization as a process that starts extending awa from regular polynomial observables.

By prop. , on regular polynomial observables the S-matrix is given as follows:

Definition

(perturbative S-matrix on regular polynomial observables)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Recall that the time-ordered product on regular polynomial observables is the star product $\star_F$ induced by the Feynman propagator (def. ) and that, due to the non-singular nature of regular polynomial observables, this is given by conjugation of the pointwise product (89) with $\mathcal{T}$ (?) as

T(A_1, A_2) \;=\; A_1 \star_F A_2 \;=\; \mathcal{T}( \mathcal{T}^{-1}(A_1) \cdot \mathcal{T}^{-1}(A_2))

(prop. ).

We say that the perturbative S-matrix scheme on regular polynomial observables is the exponential with respect to $\star_F$ :

\mathcal{S} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g , j] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}((\hbar))[ [ g, j] ]

given by

\mathcal{S}(S_{int}) = \exp_{\star_F} \left( \tfrac{1}{i \hbar} S_{int}) \right) \coloneqq 1 + \tfrac{1}{\i \hbar} S_{int} + \tfrac{1}{2} \tfrac{1}{(i \hbar)^2} S_{int} \star_F S_{int} + \cdots \,.

We think of $S_{int}$ here as an adiabatically switched non-point-interaction action functional.

We write $\mathcal{S}(S_{int})^{-1}$ for the inverse with respect to the Wick product (which exists by remark )

\mathcal{S}(S_{int})^{-1} \star_H \mathcal{S}(S_{int}) = 1 \,.

Notice that this is in general different form the inverse with respect to the time-ordered product $\star_F$ , which is $\mathcal{S}(-S_{int})$ :

\mathcal{S}(-S_{int}) \star_F \mathcal{S}(S_{int}) = 1 \,.

Similarly, by def. , on regular polynomial observables the quantum Møller operator is given as follows:

Definition

(quantum Møller operator on regular polynomial observables)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. . Given an adiabatically switched non-point-interaction action functional in the form of a regular polynomial observable of degree 0

S_{int} \;\in\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar , g, j] ]

then the corresponding quantum Møller operator on regular polynomial observables

\mathcal{R}^{-1} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar , g, j] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g, j] ]

is given by the derivative of Bogoliubov's formula

\mathcal{R}^{-1} \;\coloneqq\; \mathcal{S}(S_{int})^{-1} \star_H (\mathcal{S}(S_{int}) \star_F (-)) \,,

where $\mathcal{S}(S_{int}) = \exp_{\mathcal{T}}\left( \tfrac{-1}{i \hbar} S_{int} \right)$ is the perturbative S-matrix from def. .

This indeed lands in formal power series in Planck's constant $\hbar$ (by remark ), instead of in more general Laurent series as the perturbative S-matrix does (def. ).

Hence the inverse map is

\mathcal{R} \;=\; \mathcal{S}(-S_{int}) \star_F ( \mathcal{S}(S_{int}) \star(-) ) \,.

(Bogoliubov-Shirkov 59; the above terminology follows Hawkins-Rejzner 16, below def. 5.1)

(Beware that compared to Fredenhagen, Rejzner et. al. we change notation conventions $\mathcal{R} \leftrightarrow \mathcal{R}^{-1}$ in order to bring out the analogy to (the conventions for the) time-ordered product $A_1 \star_F A_2 = \mathcal{T}(\mathcal{T}^{-1}(A_1) \cdot \mathcal{T}^{-1}(A_2))$ on regular polynomial observables.)

Still by def. , on regular polynomial observables the interacting field algebra of observables is given as follows:

Definition

(interacting field algebra structure on regular polynomial observables)

S_{int} \;\in\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g, j] ] \,,

then the interacting field algebra structure on regular polynomial observables

PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g, j] ] \otimes PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g, h] ] \overset{ \star_{int} }{\longrightarrow} PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar, g, j] ]

is the conjugation of the Wick algebra-structure by the quantum Møller operator (def. ):

A_1 \star_{int} A_2 \;\coloneqq\; \mathcal{R} \left( \mathcal{R}^{-1}(A_1) \star_H \mathcal{R}^{-1}(A_2) \right)

(e.g. Fredenhagen-Rejzner 11b, (19))

Notice the following dependencies of these defnitions, which we leave notationally implicit:

endomorphism of regular polynomial observables	meaning	depends on choice of
$\phantom{AA}\mathcal{T}$	time-ordering	free Lagrangian density and Wightman propagator
$\phantom{AA}\mathcal{S}$	S-matrix	free Lagrangian density and Wightman propagator
$\phantom{AA}\mathcal{R}$	quantum Møller operator	free Lagrangian density and Wightman propagator and interaction

$\,$

After having discussed the uniqueness of the time-ordered products away from coinciding spacetime arguments (prop. ) we now phrase and then discuss the freedom in defining these products at coinciding arguments, thus ("re"-)normalizing them.

Definition

(Epstein-Glaser ("re"-)normalization of perturbative QFT)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Prop. implies that the problem of constructing a sequence of time-ordered products (def. ), hence, by prop. , an S-matrix scheme (def. ) for perturbative quantum field theory around the given free field vacuum, is equivalently a problem of a sequence of compatible extensions of distributions of the star products $\underset{k \; \text{arguments}}{\underbrace{(-)\star_F \cdots \star_F (-)}}$ of the Feynman propagator on $k$ arguments from the complement of coinciding events inside the Cartesian products $\Sigma^k$ of spacetime $\Sigma$ , along the canonical inclusion

\Sigma^k \setminus \left\{ (x_i) \,\vert\, \underset{i \neq j}{\exists} (x_i = x_j) \right\} \overset{\phantom{AAA}}{\hookrightarrow} \Sigma^k \,.

Via the associativity (239) of the restricted time-ordered product thesese choices are naturally made by induction over $k$ , choosing the $(k+1)$ -ary time-ordered product $T_{k+1}$ as an extension of distributions of $T_k(\underset{k \, \text{args}}{\underbrace{-, \cdots, -}}) \star_F (-)$ .

This inductive choice of extension of distributions of the time-ordered product to coinciding interaction points deserves to be called a choice of normalization of the time-ordered product (e.g. Scharf 94, section 4.3), but for historical reasons (see remark and remark ) it is known as re-normalization. Specifically the inductive construction by extension to coinciding interaction points is known as Epstein-Glaser renormalization.

In (Epstein-Glaser 73) this is phrased in terms of splitting of distributions. In (Brunetti-Fredenhagen 00, sections 4 and 7) the perspective via extension of distributions is introduced, following (Stora 93). Review is in (Dütsch 18, section 3.3.2).

Proposition already shows that the freedom in choosing the ("re"-)normalization of time-ordered products is at most that of extending them to the “fat diagonal”, where at least one pair of interaction points coincides. The following proposition says that when making these choices inductively in the arity of the time-ordered products as in def. then the available choice of ("re"-)normalization) at each stage is in fact only that of extension to the actual diagonal, where all interaction points coincide:

Proposition

(("re"-)normalization is inductive extension of time-ordered products to diagonal)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Assume that for $n \in \mathbb{N}$ , time-ordered products $\{T_{k}\}_{k \leq n}$ of arity $k \leq n$ have been constructed in the sense of def. . Then the time-ordered product $T_{n+1}$ of arity $n+1$ is uniquely fixed on the complement

\Sigma^{n+1} \setminus diag(n) \;=\; \left\{ (x_i \in \Sigma)_{i = 1}^n \;\vert\; \underset{i,j}{\exists} (x_i \neq x_j) \right\}

of the image of the diagonal inclusion $\Sigma \overset{diag}{\longrightarrow} \Sigma^{n}$ (where we regarded $T_{n+1}$ as a generalized function on $\Sigma^{n+1}$ according to remark ).

This statement appears in (Popineau-Stora 82), with (unpublished) details in (Stora 93), following personal communication by Henri Epstein (according to Dütsch 18, footnote 57). Following this, statement and detailed proof appeared in (Brunetti-Fredenhagen 99).

Proof

We will construct an open cover of $\Sigma^{n+1} \setminus \Sigma$ by subsets $\mathcal{C}_I \subset \Sigma^{n+1}$ which are disjoint unions of non-empty sets that are in causal order, so that by causal factorization the time-ordered products $T_{n+1}$ on these subsets are uniquely given by $T_{k}(-) \star_H T_{n-k}(-)$ . Then we show that these unique products on these special subsets do coincide on intersections. This yields the claim by a partition of unity.

We now say this in detail:

For $I \subset \{1, \cdots, n+1\}$ write $\overline{I} \coloneqq \{1, \cdots, n+1\} \setminus I$ . For $I, \overline{I} \neq \emptyset$ , define the subset

\mathcal{C}_I \;\coloneqq\; \left\{ (x_i)_{i \in \{1, \cdots, n+1\}} \in \Sigma^{n+1} \;\vert\; \{x_i\}_{i \in I} {\vee\!\!\!\wedge} \{x_j\}_{j \in \{1, \cdots, n+1\} \setminus I} \right\} \;\subset\; \Sigma^{n+1} \,.

Since the causal order-relation involves the closed future cones/closed past cones, respectively, it is clear that these are open subsets. Moreover it is immediate that they form an open cover of the complement of the diagonal:

\underset{ { I \subset \{1, \cdots, n+1\} \atop { I, \overline{I} \neq \emptyset } } }{\cup} \mathcal{C}_I \;=\; \Sigma^{n+1} \setminus diag(\Sigma) \,.

(Because any two distinct points in the globally hyperbolic spacetime $\Sigma$ may be causally separated by a Cauchy surface, and any such may be deformed a little such as not to intersect any of a given finite set of points. )

Hence the condition of causal factorization on $T_{n+1}$ implies that restricted to any $\mathcal{C}_{I}$ these have to be given (in the condensed generalized function-notation from remark on any unordered tuple $\mathbf{X} = \{x_1, \cdots, x_{n+1}\} \in \mathcal{C}_I$ with corresponding induced tuples $\mathbf{I} \coloneqq \{x_i\}_{i \in I}$ and $\overline{\mathbf{I}} \coloneqq \{x_i\}_{i \in \overline{I}}$ by

(240)

T_{n+1}( \mathbf{X} ) \;=\; T(\mathbf{I}) T(\overline{\mathbf{I}}) \phantom{AA} \text{for} \phantom{A} \mathcal{X} \in \mathcal{C}_I \,.

This shows that $T_{n+1}$ is unique on $\Sigma^{n+1} \setminus diag(\Sigma)$ if it exists at all, hence if these local identifications glue to a global definition of $T_{n+1}$ . To see that this is the case, we have to consider any two such subsets

I_1, I_2 \subset \{1, \cdots, n+1\} \,, \phantom{AA} I_1, I_2, \overline{I_1}, \overline{I_2} \neq \emptyset \,.

By definition this implies that for

\mathbf{X} \in \mathcal{C}_{I_1} \cap \mathcal{C}_{I_2}

a tuple of spacetime points which decomposes into causal order with respect to both these subsets, the corresponding mixed intersections of tuples are spacelike separated:

\mathbf{I}_1 \cap \overline{\mathbf{I}_2} \; {\gt\!\!\!\!\lt} \; \overline{\mathbf{I}_1} \cap \mathbf{I}_2 \,.

By the assumption that the $\{T_k\}_{k \neq n}$ satisfy causal factorization, this implies that the corresponding time-ordered products commute:

(241)

T(\mathbf{I}_1 \cap \overline{\mathbf{I}_2}) \, T(\overline{\mathbf{I}_1} \cap \mathbf{I}_2) \;=\; T(\overline{\mathbf{I}_1} \cap \mathbf{I}_2) \, T(\mathbf{I}_1 \cap \overline{\mathbf{I}_2}) \,.

Using this we find that the identifications of $T_{n+1}$ on $\mathcal{C}_{I_1}$ and on $\mathcal{C}_{I_2}$ , accrding to (260), agree on the intersection: in that for $\mathbf{X} \in \mathcal{C}_{I_1} \cap \mathcal{C}_{I_2}$ we have

\begin{aligned} T( \mathbf{I}_1 ) T( \overline{\mathbf{I}_1} ) & = T( \mathbf{I}_1 \cap \mathbf{I}_2 ) T( \mathbf{I}_1 \cap \overline{\mathbf{I}_2} ) \, T( \overline{\mathbf{I}_1} \cap \mathbf{I}_2 ) T( \overline{\mathbf{I}_1} \cap \overline{\mathbf{I}_2} ) \\ & = T( \mathbf{I}_1 \cap \mathbf{I}_2 ) \underbrace{ T( \overline{\mathbf{I}_1} \cap \mathbf{I}_2 ) T( \mathbf{I}_1 \cap \overline{\mathbf{I}_2} ) } T( \overline{\mathbf{I}_1} \cap \overline{\mathbf{I}_2} ) \\ & = T( \mathbf{I}_2 ) T( \overline{\mathbf{I}_2} ) \end{aligned}

Here in the first step we expanded out the two factors using (260) for $I_2$ , then under the brace we used (261) and in the last step we used again (260), but now for $I_1$ .

To conclude, let

\left( \chi_I \in C^\infty_{cp}(\Sigma^{n+1}) \right)_{ { I \subset \{1, \cdots, n+1\} } \atop { I, \overline{I} \neq \emptyset } }

be a partition of unity subordinate to the open cover formed by the $\mathcal{C}_I$ . Then the above implies that setting for any $\mathbf{X} \in \Sigma^{n+1} \setminus diag(\Sigma)$

T_{n+1}(\mathbf{X}) \;\coloneqq\; \underset{ { I \in \{1, \cdots, n+1\} } \atop { I, \overline{I} \neq \emptyset } }{\sum} \chi_i(\mathbf{X}) T( \mathbf{I} ) T( \overline{\mathbf{I}} )

is well defined and satisfies causal factorization.

Since ("re"-)normalization involves making choices, there is the freedom to impose further conditions that one may want to have satisfied. These are called renormalization conditions.

Definition

(renormalization conditions, protection from quantum corrections and quantum anomalies)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Then a condition $P$ on $k$ -ary functions of the form

T_k \;\colon\; \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ] {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [\hbar, g, j] ]}} \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}((\hbar))[ [g , j] ]

is called a renormalization condition if

it holds for the unique time-ordered products away from coinciding spacetime arguments (according to prop. );
whenever it holds for all unrestricted $T_{k \leq n}$ for some $n \in \mathbb{N}$ , then it also holds for $T_{n+1}$ restricted away from the diagonal:

$P(T_k)_{k \leq n} \;\Rightarrow\; P\left( T_{n+1}\vert_{\Sigma^{n+1} \setminus diag(\Sigma)} \right) \,.$

This means that a renormalization condition is a condition that may consistently be imposed degreewise in an inductive construction of time-ordered products by degreewise extension to the diagonal, according to prop. .

If specified renormalization conditions $\{P_i\}$ completely remove any freedom in the choice of time-ordered products for a given quantum observable, one says that the renormalization conditions protects the observable against quantum corrections.

If for specified renormalization conditions $\{P_i\}$ there is no choice of time-ordered products $\{T_k\}_{k \in \mathbb{N}}$ (def. ) that satisfies all these conditions, then one says that an interacting perturbative QFT satisfying $\{P_i\}$ fails to exist due to a quantum anomaly.

Proposition

(basic renormalization conditions)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Then the following conditions are renormalization conditions (def. ):

(field independence) The functional derivative of a polynomial observable arising as a time-ordered product takes contributions only from the arguments, not from the product operation itself; in generalized function-notation:

(242) $\frac{\delta}{\delta \mathbf{\Phi}^a(x)} T(A_1, \cdots, A_n) \;=\; \underset{1 \leq k \leq n}{\sum} T\left( A_1, \cdots, A_{k-1}, \frac{\delta}{\delta \mathbf{\Phi}^a(x)}A_k, A_{k+1}, \cdots, A_n \right)$
(translation equivariance) If the underlying spacetime is Minkowski spacetime, $\Sigma = \mathbb{R}^{p,1}$ , with the induced action of the translation group on polynomial observables

$\rho \;\colon\; \mathbb{R}^{p,1} \times PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]$

then

$\rho_v \left( {\, \atop \,} T(A_1, \cdots, A_n) {\, \atop \,}\right) \;=\; T(\rho_{v}(A_1), \cdots, \rho_v(A_n))$
(quantum master equation, master Ward identity) see prop.

(if this condition fails, the corresponding quantum anomaly (def. ) is called a gauge anomaly)

(Dütsch 18, p. 150 and section 4.2)

Proof

For the first two statements this is obvious from prop. and prop. , which imply that $T_{n+1}\vert_{\Sigma^{n+1} \setminus diag(\Sigma)}$ is uniquely specified from $\{T_k\}_{k \leq n}$ via the star product induced by the Feynman propagator, and the fact that, on Minkowski spacetime, this is manifestly translation invariant and independent of the fields (e.q. prop. ).

The third statement requires work. That the quantum master equation/(master Ward identity always holds on regular polynomial observables is prop. below. That it holds for $T_{n+1}\vert_{\Sigma^{n+1} \setminus diag(\Sigma)}$ if it holds for $\{T_k\}_{k \leq n}$ is shown in (Duetsch 18, section 4.2.2).

$\,$

We discuss methods for normalization (prop. ) and re-normalization in detail in the next chapter.

$\,$

Feynman perturbation series

By def and the main theorem of perturbative renormalization (theorem ), the construction of perturbative S-matrix schemes/time-ordered products may be phrased as ("re-")normalization of the star product induced by the Feynman propagator, namely as a choice of extension of distributions of the this star-product to the locus of coinciding interaction points.

Since the star product is the exponential of the binary contraction with the Feynman propagator, it is naturally expanded as a sum of products of distributions labeled by finite multigraphs (def. below), where each vertex corresponds to an interaction or source field insertion, and where each edge corresponds to one contractions of two of these with the Feynman propagator. The products of distributions arising this way are the Feynman amplitudes (prop. below).

If the free field vacuum is decomposed as a direct sum of distinct free field types/species (def. below), then in addition to the vertices also the edges in these graphs receive labels, now by the field species whose particular Feynman propagator is being used in the contraction at that edges. These labeled graphs are now called Feynman diagrams (def. below) and the products of distributions which they encode are their Feynman amplitudes built by the Feynman rules (prop. below).

The choice of ("re"-)normalization of the time-ordered products/S-matrix is thus equivalently a choice of ("re"-)normalization of the Feynman amplitudes for all possible Feynman diagrams. These are usefully organized in powers of $\hbar$ by their loop order (prop. below).

In conclusion, the Feynman rules make the perturbative S-matrix be equal to a formal power series of Feynman amplitudes labeled by Feynman graphs. As such it is known as the Feynman perturbation series (example below).

Notice how it is therefore the combinatorics of star products that governs both Wick's lemma in free field theory as well as Feynman diagrammatics in interacting field theory:

	free field algebra of quantum observables	physics terminology	maths terminology
1)	supercommutative product	$\phantom{AA} :A_1 A_2:$ normal ordered product	$\phantom{AA} A_1 \cdot A_2$ pointwise product of functionals
2)	non-commutative product (deformation induced by Poisson bracket)	$\phantom{AA} A_1 A_2$ operator product	$\phantom{AA} A_1 \star_H A_2$ star product for Wightman propagator
3)		$\phantom{AA} T(A_1 A_2)$ time-ordered product	$\phantom{AA} A_1 \star_F A_2$ star product for Feynman propagator
	perturbative expansion of 2) via 1)	Wick's lemma	Moyal product for Wightman propagator $\Delta_H$ $\begin{aligned} & A_1 \star_H A_2 = \\ & ((-)\cdot (-)) \circ \exp \left( \hbar \int (\Delta_H)^{ab}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \right)(A_1 \otimes A_2) \end{aligned}$
	perturbative expansion of 3) via 1)	Feynman diagrams	Moyal product for Feynman propagator $\Delta_F$ $\begin{aligned} & A_1 \star_F A_2 = \\ & ((-)\cdot (-)) \circ \exp \left( \hbar \int (\Delta_F)^{ab}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}^b(y)} \right)(A_1 \otimes A_2) \end{aligned}$

$\,$

We now discuss Feynman diagrams and their Feynman amplitudes in two stages: First we consider plain finite multigraphs with linearly ordered vertices but no other labels (def. below) and discuss how these generally organize an expansion of the time-ordered products as a sum of distributional products of the given Feynman propagator (prop. below). These summands (or their vacuum expectation values) are called the Feynman amplitudes if one thinks of the underlying free field vacuum as having a single “field species” and of the chosen interaction to be a single “interaction vertex”.

But often it is possible and useful to identify different field species and different interaction vertices. In fact in applications this choice is typically evident and not highlighted as a choice. We make it explicit below as def. . Such a choice makes both the interaction term as well as the Feynman propagator decompose as sums (remark below). Accordingly then, after “multiplying out” the products of these sums that appear in the Feynman amplitudes, these, too, decompose further as as sums indexed by multigraphs whose edges are labeled by field species, and whose vertices are labeled by interactions. These labeled multigraphs are the Feynman diagrams (def. below) and the corresponding summands are the Feynman amplitudes proper (prop. below).

Definition

(finite multigraphs)

A finite multigraph is

a finite set $V$ (“of vertices”);
a finite set $E$ (“of edges”);
a function $E \overset{p}{\to} \left\{ {\,\atop \,} \{v_1, v_2\} = \{v_2, v_1\} \;\vert\; v_1, v_2 \in V \,,\; v_1 \neq v_2 {\, \atop \,} \right\}$

(sending any edge to the unordered pair of distinct vertices that it goes between).

A choice of linear order on the set of vertices of a finite multigraph is a choice of bijection of the form

V \simeq \{1, 2, \cdots, \nu\} \,.

Hence the isomorphism classes of a finite multigraphs with linearly ordered vertices are characterized by

a natural number

$\nu \coloneqq {\vert V\vert} \in \mathbb{N}$

(the number of vertices);
for each $i \lt j \in \{1, \cdots, \nu\}$ a natural number

$e_{i,j} \coloneqq {\vert p^{-1}(\{v_i,v_j\})\vert} \in \mathbb{N}$

(the number of edges between the $i$ th and the $j$ th vertex).

We write $\mathcal{G}_\nu$ for the set of such isomorphism classes of finite multigraphs with linearly ordered vertices identified with $\{1, 2, \cdots, \nu\}$ ; and we write

\mathcal{G} \;\coloneqq\; \underset{\nu \in \mathbb{N}}{\sqcup} \mathcal{G}_\nu

for the set of isomorphism classes of finite multigraphs with linearly ordered vertices of any number.

Proposition

(Feynman amplitudes of finite multigraphs)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

For $\nu \in \mathbb{N}$ , the $\nu$ -fold time-ordered product away from coinciding interaction points, given by prop.

T_\nu \;\colon\; \left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [\hbar, g, j] ] {\, \atop \,} \right)^{\otimes^\nu_{\mathbb{C}[ [\hbar, g, j] ]}}_{pds} \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}((\hbar))[ [g , j] ]

is equal to the following formal power series labeled by isomorphism classes of finite multigraphs with $\nu$ linearly ordered vertices, $\Gamma \in \mathcal{G}_\nu$ (def. ):

(243)

\begin{aligned} & T_\nu(O_1, \cdots , O_\nu) \\ & = \underset{\Gamma \in \mathcal{G}_\nu}{\sum} \Gamma\left(O_i)_{i = 1}^\nu\right) \\ & \coloneqq \underset{ \Gamma \in \mathcal{G}_\nu }{\sum} prod \circ \underset{ r \lt s \in \{1, \cdots, \nu\} }{\prod} \frac{\hbar^{e_{r,s}}}{e_{r,s}!} \left\langle (\Delta_{F})^{e_{r,s}} , \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_r^{e_{r,s}}} \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_s^{e_{r,s}}} \right\rangle \left( O_1 \otimes \cdots \otimes O_{\nu} \right) \\ & \coloneqq \underset{ \Gamma \in \mathcal{G}_\nu }{\sum} ((-) \cdot \cdots \cdot (-)) \circ \underset{ r \lt s \in \{1, \cdots,\nu\} }{\prod} \frac{\hbar^{e_{r,s}}}{e_{r,s}!} \\ & \phantom{AAA} \underset{i = 1, \cdots e_{r,s}}{\prod} \underset{\Sigma \times \Sigma}{\int} dvol_\Sigma(x_i) dvol_\Sigma(y_i) \, \Delta_F^{a_i b_i}(x_i,y_i) \\ & \phantom{AAAAAA} \left( O_1 \otimes \cdots \otimes O_{r-1} \otimes \frac{ \delta^{e_{r,s}} O_r }{ \delta \mathbf{\Phi}^{a_1}(x_1) \cdots \delta \mathbf{\Phi}^{a_{e_{r,s}}}(x_{e_{r,s}}) } \otimes O_{r+1} \otimes \cdots \otimes O_{s-1} \otimes \frac{ \delta^{e_{r,s}} O_s }{ \delta \mathbf{\Phi}^{b_1}(y_1) \cdots \delta \mathbf{\Phi}^{b_{e_{r,s}}}(y_{e_{r,s}}) } \otimes O_{s+1} \otimes \cdots \otimes O_\nu \right) \,, \end{aligned}

where $e_{r,s} \coloneqq e_{r,s}(\Gamma)$ is, for short, the number of edges between vertex $r$ and vertex $s$ in the finite multigraph $\Gamma$ of the outer sum, according to def. .

Here the summands of the expansion (243)

(244)

\Gamma\left( (O_i)_{i = 1}^\nu\right) \;\coloneqq\; prod \circ \underset{ r \lt s \in \{1, \cdots,\nu\} }{\prod} \frac{\hbar^{e_{r,s}}}{e_{r,s}!} \left\langle (\Delta_{F})^{e_{r,s}} , \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_r^{e_{r,s}}} \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_s^{e_{r,s}}} \right\rangle \left( O_1 \otimes \cdots \otimes O_{\nu} \right) \;\in\; PolyObs(E_{\text{BV-BRT}})((\hbar))[ [g,j ] ]

and/or their vacuum expectation values

\left\langle \Gamma\left((V_i)_{i = 1}^v\right) \right\rangle \;\in\; \mathbb{C}((\hbar))[ [ h, j] ]

are called the Feynman amplitudes for scattering processes in the given free field vacuum of shape $\Gamma$ with interaction vertices $O_i$ . Their expression as products of distributions via algebraic expression on the right hand side of (244) is also called the Feynman rules.

(Keller 10, IV.1)

Proof

We proceed by induction over the number $v$ of vertices. The statement is trivially true for a single vertex. So assume that it is true for $v \geq 1$ vertices. It follows that

\begin{aligned} & T(O_1, \cdots, O_\nu, O_{\nu+1}) \\ & = T( T(O_1, \cdots ,O_\nu), O_{\nu+1} ) \\ &= prod \circ \exp\left( \left\langle \hbar \Delta_F, \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right\rangle \right) \left( \left( prod \circ \!\!\!\! \underset{\Gamma \in \mathcal{G}_\nu }{\sum} \underset{ { r \lt s } \atop { \in \{1, \cdots, \nu\} } }{\prod} \frac{1}{e_{r,s}!} \left\langle (\hbar \Delta_F)^{e_{r,s}} \,,\, \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_r^{e_{r,s}}} \frac{\delta^{e_{r,s}}}{ \delta \mathbf{\Phi}_s^{e_{r,s}} } \right\rangle (O_1 \otimes \cdots \otimes O_\nu) \right) \,\otimes\, O_{\nu+1} \right) \\ & = prod \circ \underset{\Gamma \in \mathcal{G}_\nu }{\sum} \\ & \phantom{=} \underset{ { r \lt s } \atop { \in \{1,\cdots, \nu\}} }{\prod} \frac{1}{e_{r,s}!} \left\langle (\hbar \Delta_F)^{e_{r,s}} \,,\, \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_r^{e_{r,s}}} \frac{\delta^{e_{r,s}}}{ \delta \mathbf{\Phi}_s^{e_{r,s}} } \right\rangle \\ & \phantom{=} \underset{ { e_{\nu+1} =} \atop { e_{1,{\nu+1}} + \cdots + e_{\nu,\nu + 1} } }{\sum} \underset{ = (e_{1,\nu + 1}) \cdots (e_{\nu,\nu+1})) }{ \underbrace{ \frac{ \left( { e_{\nu + 1} } \atop { (e_{1, \nu + 1}), \cdots, (e_{\nu , \nu+1}) } \right) }{ ( e_{\nu+1} ) ! } } } \left\langle (\hbar \Delta_F)^{e_{\nu+1}} \left( \frac{\delta^{e_{1,\nu+1}} O_1 }{\delta \mathbf{\Phi}^{e_{1,\nu+1}}} \otimes \cdots \otimes \frac{ \delta^{e_{\nu,\nu+1}} O_\nu }{ \delta \mathbf{\Phi}^{e_{\nu,\nu+1}} } \;\otimes\; \frac{ \delta^{ e_{\nu + 1} } O_{\nu+1} }{ \delta \mathbf{\Phi}^{e_{1,\nu+1} + \cdots + e_{\nu,\nu+1}} } \right\rangle \right) \\ &= prod \circ \underset{\Gamma \in \mathcal{G}_{\nu+1} }{\sum} \underset{ { r \lt s } \atop { \in \{1, \cdots, \nu+1\} } }{\prod} \tfrac{1}{e_{r,s}!} \left\langle (\hbar \Delta_F)^{e_{r,s}} \,,\, \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_r^{e_{r,s}}} \frac{\delta^{e_{r,s}}}{\delta \mathbf{\Phi}_s^{e_{r,s}}} \right\rangle (O_1 \otimes \cdots \otimes O_{\nu+1}) \end{aligned}

The combinatorial factor over the brace is the multinomial coefficient expressing the number of ways of distributing $e_{\nu+1}$ -many functional derivatives to $v$ factors, via the product rule, and quotiented by the factorial that comes from the exponential in the definition of the star product.

Here in the first step we used the associativity (239) of the restricted time-ordered product, in the second step we used the induction assumption, in the third we passed the outer functional derivatives through the pointwise product using the product rule, and in the fourth step we recognized that this amounts to summing in addition over all possible choices of sets of edges from the first $v$ vertices to the new $\nu+1$ st vertex, which yield in total the sum over all diagrams with $\nu+1$ vertices.

If the free field theory is decomposed as a direct sum of free field theories (def. below), we obtain a more fine-grained concept of Feynman amplitudes, associated not just with a finite multigraph, but also with a labelling of this graph by field species and interaction types. These labeled multigraphs are the genuine Feynman diagrams (def. below):

Definition

(field species and interaction vertices)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , and let $g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar , g , j] ]\langle g,j \rangle$ be a local observable regarded as an adiabatically switched interaction action functional.

Then

a choice of field species is a choice of decomposition of the BV-BRST field bundle $E_{\text{BV-BRST}}$ as a fiber product over finite set $Spec = \{sp_1, sp_2, \cdots, sp_n\}$ of (graded super-) field bundles

$E_{\text{BV-BRST}} \;\simeq\; E_{sp_1} \times_{\Sigma} \cdots \times_\Sigma E_{sp_n} \,,$

such that the gauge fixed free Lagrangian density $\mathbf{L}'$ is the sum

$\mathbf{L}' \;=\; \mathbf{L}'_{sp_1} + \cdots + \mathbf{L}'_{sp_n}$

of free Lagrangian densities

$\mathbf{L}'_{sp_i} \in \Omega^{p+1,0}_\Sigma(E_i)$

on these separate field bundles.

a choice of interaction vertices and external vertices is a choice of sum decomposition

$g S_{int} + j A \;=\; \underset{i \in Ext}{\sum} g S_{int,i} + \underset{j \in Int}{\sum} j A_j$

parameterized by finite sets $Int$ and $Ext$ , to be called the sets of internal vertex labels and external vertex labels, respectively.

Remark

(Feynman propagator for separate field species)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. .

Then a choice of field species as in def. induces a corresponding decomposition of the Feynman propagator of the gauge fixed free field theory

\Delta_F \;\in\; \Gamma'_{\Sigma \times \Sigma}( E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}} )

as the sum of Feynman propagators for each of the chosen field species:

\Delta_F \;=\; \Delta_{F,1} + \cdots + \Delta_{F,n} \;\in\; \underoverset{i = 1}{n}{\oplus} \Gamma'_{\Sigma \times \Sigma}( E_{sp_i} \boxtimes E_{sp_i} ) \;\subset\; \Gamma'_{\Sigma \times \Sigma}( E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}} )

hence in components, with $(\phi^A$ the collective field coordinates on $E_{\text{BV-BRST}}$ , this decomposition is of the form

\left( \Delta_F^{A, B} \right) \;=\; \left( \array{ (\Delta_{F,1}^{a b}) & 0 & 0 & \cdots & 0 \\ 0 & (\Delta_{F,2}^{\alpha \beta}) & 0 & \cdots & 0 \\ \vdots & & & & \vdots \\ 0 & \cdots & \cdots & 0 & (\Delta_{F,n}^{i j}) } \right)

Example

(field species in quantum electrodynamics)

The field bundle for Lorenz gauge fixed quantum electrodynamics on Minkowski spacetime $\Sigma$ admits a decomposition into field species, according to def. , as

E_{\text{BV-BRST}} \;=\; \underset{ \text{Dirac} \atop \text{field} }{ \underbrace{ (S_{odd} \times \Sigma) }} \times_\Sigma \underset{ {\text{electromagnetic field &amp;}} \atop {\text{Nakanishi-Lautrup field}} }{ \underbrace{ T^\ast\Sigma \times_\Sigma (\mathbb{R} \times \Sigma) }} \times_\Sigma \underset{ \text{ghost field} }{ \underbrace{ (\mathbb{R}[1] \times \Sigma) } } \times_\Sigma \underset{ \text{antighost field} }{ \underbrace{ (\mathbb{R}[-1] \times \Sigma) } }

(by example ) and example )).

The corresponding sum decomposition of the Feynman propagator, according to remark , is

\Delta_F \;=\; \underset{ \text{Dirac} \atop \text{field} }{ \underbrace{ \Delta_F^{\text{electron}} } } + \underset{ \text{electromagnetic field &amp;} \atop \text{Nakanishi-Lautrup field} }{ \underbrace{ \left( \array{ \Delta_F^{photon} & * \\ * & * } \right) } } + \Delta_F^{ghost} + \Delta_F^{\text{antighost}} \,,

where

$\Delta_F^{\text{electron}}$ is the electron propagator (def. ));
$\Delta_F^{photon}$ is the photon propagator in Gaussian-averaged Lorenz gauge (prop. );
the ghost field and antighost field Feynman propagators $\Delta_F^{ghost}$ , and $\Delta_F^{antighost}$ are each one copy of the Feynman propagator of the real scalar field (prop. ), while the Nakanishi-Lautrup field contributes a mixing with the photon propagator, notationally suppressed behind the star-symbols above.

Definition

(Feynman diagrams)

Let moreover

E_{\text{BV-BRST}} \;\simeq\; \underset{sp \in Spec}{\times} E_{sp} \,,

be a choice of field species, according to def ,

g S_{int} + j A \;=\; \underset{i \in Ext}{\sum} g S_{int,i} + \underset{j \in Int}{\sum} j A_j

a choice of internal and external interaction vertices according to def. .

With these choices, we say that a Feynman diagram $(\Gamma, vertlab, edgelab)$ is

a finite multigraph with linearly ordered vertices (def. )

$\Gamma \in \mathcal{G} \,,$
a function from its vertices

$vertlab \;\colon\; V_{\Gamma} \longrightarrow Int \sqcup Ext$

to the disjoint union of the chosen sets of internal and external vertex labels;
a function from its edges

$edgelab \;\colon\; E_{\Gamma} \to Spec$

to the chosen set of field species.

We write

\array{ \mathcal{G}^{Feyn} &\overset{\text{forget} \atop \text{labels}}{\longrightarrow}& \mathcal{G} \\ (\Gamma,vertlab, edgelab) &\mapsto& \Gamma }

for the set of isomorphism classes of Feynman diagrams with labels in $Sp$ , refining the set of isomorphisms of plain finite multigraphs with linearly ordered vertices from def. .

Proposition

(Feynman amplitudes for Feynman diagrams)

Let moreover

E_{\text{BV-BRST}} \;\simeq\; \underset{sp \in Spec}{\times} E_{sp} \,,

be a choice of field species, according to def , hence inducing, by remark , a sum decomposition of the Feynman propagator

(245)

\Delta_F \;=\; \underset{sp \in Spec}{\sum}\Delta_{F,sp} \,,

and let

(246)

g S_{int} + j A \;=\; \underset{i \in Ext}{\sum} g S_{int,i} + \underset{j \in Int}{\Sum} j A_j

be a choice of internal and external interaction vertices according to def. .

Then by “multiplying out” the products of the sums (245) and (246) in the formula (244) for the Feynman amplitude $\Gamma\left( (g S_{int} + j A))_{i = 1}^\nu \right)$ (def. ) this decomposes as a sum of the form

\Gamma\left( (g S_{int} + j A)_{i = 1}^\nu \right) \;=\; \underset{ { V_\Gamma \overset{vertlab}{\longrightarrow} Int \sqcup Ext} \atop { E_\Gamma \overset{edgelab}{\longrightarrow} Spec } }{\sum} \left( \Gamma, edgelab, vertlab \right) (g S_{int} + j A)

over all ways of labeling the vertices $v$ of $\Gamma$ by the internal or external vertex labels, and the edges $e$ of $\Gamma$ by field species. The corresponding summands

\left( \Gamma, edgelab, vertlab \right) (g S_{int} + j A) \;\in\; PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]

or rather their vacuum expectation value

\left\langle \left( \Gamma, edgelab, vertlab \right) (g S_{int} + j A) \right\rangle \;\in\; \mathbb{C}[ [ \hbar, g, j ] ]

are called the Feynman amplitude associated with these Feynman diagrams.

Example

(Feynman amplitudes in causal perturbation theory – example of QED)

To recall, in perturbative quantum field theory, Feynman diagrams (def. ) are labeled finite multigraphs (def. ) that encode products of Feynman propagators, called Feynman amplitudes (prop. ) which in turn contribute to probability amplitudes for physical scattering processes – scattering amplitudes (example ):

The Feynman amplitudes are the summands in the Feynman perturbation series-expansion (example ) of the scattering matrix (def. )

\mathcal{S} \left( S_{int} \right) = \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!} \frac{1}{(i \hbar)^k} T( \underset{k \, \text{factors}}{\underbrace{S_{int}, \cdots , S_{int}}} )

of a given interaction Lagrangian density $L_{int}$ (def. ).

The Feynman amplitudes are the summands in an expansion of the time-ordered products $T(\cdots)$ (def. ) of the interaction with itself, which, away from coincident vertices, is given by the star product of the Feynman propagator $\Delta_F$ (prop. ), via the exponential contraction

T(S_{int}, S_{int}) \;=\; prod \circ \exp \left( \hbar \int \Delta_{F}^{a b}(x,y) \frac{\delta}{\delta \mathbf{\Phi}^a(x)} \otimes \frac{\delta}{\delta \mathbf{\Phi}(y)} \right) ( S_{int} \otimes S_{int} ) \,.

Each edge in a Feynman diagram corresponds to a factor of a Feynman propagator in $T( \underset{k \, \text{factors}}{\underbrace{S_{int} \cdots S_{int}}} )$ , being a distribution of two variables; and each vertex corresponds to a factor of the interaction Lagrangian density at $x_i$ .

For example quantum electrodynamics (example ) in Gaussian-averaged Lorenz gauge (example ) involves (via example ):

the Dirac field modelling the electron, with Feynman propagator called the electron propagator (def. ), here to be denoted

$\Delta \phantom{AAAA} \text{electron propagator}$
the electromagnetic field modelling the photon, with Feynman propagator called the photon propagator (prop. ), here to be denoted

$G \phantom{AAAA} \text{photon propagator}$
the electron-photon interaction (48)

$L_{int} \;=\; \underset{ \text{interaction} }{ \underbrace{ i g (\gamma^\mu)^\alpha{}_\beta } } \, \underset{ { \text{incoming} \atop \text{electron} } \atop \text{field} }{\underbrace{\overline{\psi_\alpha}}} \; \underset{ { \, \atop \text{photon} } \atop \text{field} }{\underbrace{a_\mu}} \; \underset{ {\text{outgoing} \atop \text{electron} } \atop \text{field} }{\underbrace{\psi^\beta}}$

The Feynman diagram for the electron-photon interaction alone is

where the solid lines correspond to the electron, and the wiggly line to the photon. The corresponding product of distributions (prop. ) is (written in generalized function-notation, example )

\underset{ \text{loop order} }{ \underbrace{ \hbar^{3/2-1} } } \underset{ \text{electron-photon} \atop \text{interaction} }{ \underbrace{ i g (\gamma^\mu)^\alpha{}_\beta } } \,. \, \underset{ {\text{incoming} \atop \text{electron}} \atop \text{propagator} }{ \underbrace{ \overline{\Delta(-,x)}_{-, \alpha} } } \underset{ { \, \atop \text{photon} } \atop \text{propagator} }{ \underbrace{ G(x,-)_{\mu,-} } } \underset{ { \text{outgoing} \atop \text{electron} } \atop \text{propagator} }{ \underbrace{ \Delta(x,-)^{\beta, -} } }

Hence a typical Feynman diagram in the QED Feynman perturbation series induced by this electron-photon interaction looks as follows:

where on the bottom the corresponding Feynman amplitude product of distributions is shown; now notationally suppressing the contraction of the internal indices and all prefactors.

For instance the two solid edges between the vertices $x_2$ and $x_3$ correspond to the two factors of $\Delta(x_2,x_2)$ :

This way each sub-graph encodes its corresponding subset of factors in the Feynman amplitude:

graphics grabbed from Brouder 10

A priori this product of distributions is defined away from coincident vertices: $x_i \neq x_j$ (prop. below). The definition at coincident vertices $x_i = x_j$ requires a choice of extension of distributions (def. below) to the diagonal locus of coincident interaction points. This choice is the ("re-")normalization (def. below) of the Feynman amplitude.

Example

(Feynman perturbation series)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , and let

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, h ] ]\langle g , j\rangle

be a local observable, regarded as a adiabatically switched interaction action functional.

By prop. every choice of perturbative S-matrix (def. )

\mathcal{S}(g S_{int} + j A) \;\in\; PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g, j ] ] +

has an expansion as a formal power series of the form

\mathcal{S}(g S_{int} + j A) \;=\; \underset{\Gamma \in \mathcal{G}}{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma)}\right) \,,

where the series is over all finite multigraphs with linearly ordered vertices $\Gamma$ (def. ), and the summands are the corresponding ("re"-)normalized (def. ) Feynman amplitudes (prop. ).

If moreover a choice of field species and of internal and external interaction vertices is made, according to def. , then this series expansion refines to an expansion over all Feynman diagrams $(\Gamma,edgelab, vertlab)$ (def. ) of Feynman amplitudes $(\Gamma, edgelab,vertlab)(g S_{int} + j A)$ (def. ):

\mathcal{S}(g S_{int} + j A) \;=\; \underset{(\Gamma,edgelab, vertlab) \in \mathcal{G}^{Feyn}}{\sum} (\Gamma, edgelab,vertlab)(g S_{int} + j A) \,,

Expressed in this form the S-matrix is known as the Feynman perturbation series.

Remark

(no tadpole Feynman diagrams)

In the definition of finite multigraphs in def. there are no edges considered that go from any vertex to itself. Accordingly, there are no such labeled edges in Feynman diagrams (def. ):

In pQFT these diagrams are called tadpoles, and their non-appearance is considered part of the Feynman rules (prop. ). Via prop. this condition reflects the nature of the star product (def. ) which always contracts different tensor product factors with the Feynman propagator before taking their pointwise product.

Beware that in graph theory these tadpoles are called “loops”, while here in pQFT a “loop” in a planar graph refers instead to what in graph theory is called a face of the graph, see the discussion of loop order in prop. below.

(Keller 10, remark II.8 and proof of prop. II.7)

$\,$

Effective action

We have seen that the Feynman perturbation series expresses the S-matrix as a formal power series of Feynman amplitudes labeled by Feynman diagrams. Now the Feynman amplitude associated with a disjoint union of connected Feynman diagrams (def. below) is just the product of the amplitudes of the connected components (prop. below). This allows to re-organize the Feynman perturbation series as the ordinary exponential of the Feynman perturbation series restricted to just connected Feynman diagrams. The latter is called the effective action (def. below) because it allows to express vacuum expectation values of the S-matrix as an ordinary exponential (equation (248) below).

Definition

(connected graphs)

Given two finite multigraphs $\Gamma_1, \Gamma_2 \in \mathcal{G}$ (def. ), their disjoint union

\Gamma_1 \sqcup \Gamma_2 \;\in\; \mathcal{G}

is the finite multigraph whose set of vertices and set of edges are the disjoint unions of the corresponding sets of $\Gamma_1$ and $\Gamma_2$

V_{\Gamma_1 \sqcup \Gamma_2} \;\coloneqq\; V_{\Gamma_1} \sqcup V_{\Gamma_2}

E_{\Gamma_1 \sqcup \Gamma_2} \;\coloneqq\; E_{\Gamma_1} \sqcup E_{\Gamma_2}

and whose vertex-assigning function $p$ is the corresponding function on disjoint unions

p_{\Gamma_1 \sqcup \Gamma_2} \;\coloneqq\; p_{\Gamma_1} \sqcup p_{\Gamma_2} \,.

The operation induces a pairing on the set $\mathcal{G}$ of isomorphism classes of finite multigraphs

(-) \sqcup (-) \;\colon\; \mathcal{G} \times \mathcal{G} \longrightarrow \mathcal{G} \,.

A finite multigraph $\Gamma \in \mathcal{G}$ (def. ) is called connected if it is not the disjoint union of two non-empty finite multigraphs.

We write

\mathcal{G}_{conn} \subset \mathcal{G}

for the subset of isomorphism classes of connected finite multigraphs.

Lemma

(Feynman amplitudes multiply under disjoint union of graphs)

Let

\Gamma \;=\; \Gamma_1 \sqcup \Gamma_2 \sqcup \cdots \sqcup \Gamma_n \;\in\; \mathcal{G}

be disjoint union of graphs (def. ). then then corresponding Feynman amplitudes (prop. ) multiply by the pointwise product (def. ):

\Gamma\left( g S_{int} + j A)_{i = 1}^{\nu(\Gamma)} \right) \;=\; \Gamma_1\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma_1)}\right) \cdot \Gamma_2\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma_2)} \right) \cdot \cdots \cdot \Gamma_n\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma_n)} \right) \,.

Proof

By prop. the contributions to the S-matrix away from coinciding interaction points are given by the star product induced by the Feynman propagator, and specifically, by prop. , the Feynman amplitudes are given this way. Moreover the star product (def. ) is given by first contracting with powers of the Feynman propagator and then multiplying all resulting terms with the pointwise product of observables. This implies the claim by the nature of the combinatorial factor in the definition of the Feynman amplitudes (prop. ).

Definition

(effective action)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be an S-matrix scheme for perturbative QFT around this vacuum (def. ) and let

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, h ] ]

be a local observable.

Recall that for each finite multigraph $\Gamma \in \mathcal{G}$ (def. ) the Feynman perturbation series for $\mathcal{S}(g S_{int} + j A)$ (example )

\mathcal{S}(g S_{int} + j A) \;=\; \underset{\Gamma \in \mathcal{G}}{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{v(\Gamma)} \right)

contributes with a ("re"-)nromalized Feynman amplitude $\Gamma\left( (g S_{int} + j A)_{i = 1}^v\right) \in PolyObs(E_{\text{BV-BRST}})((\hbar))[ [ g, j ] ]$ .

We say that the corresponding effective action is $i \hbar$ times the sub-series

(247)

S_{eff}(g,j) \;\coloneqq\; i \hbar \underset{\Gamma \in \mathcal{G}_{conn}}{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma)} \right) \;\in\; PolyObs(E_{\text{BV-BRST}})((\hbar))[ [ g, j ] ]

of Feynman amplitudes that are labeled only by the connected graphs $\Gamma \in \mathcal{G}_{conn} \subset \mathcal{G}$ (def. ).

(A priori $S_{eff}(g,j)$ could contain negative powers of $\hbar$ , but it turns out that it does not; this is prop. below.)

Remark

(terminology for “effective action”)

Beware differing conventions of terminology:

In the perspective of effective quantum field theory (remark below), the effective action in def. is sometimes called the effective potential at scale $\Lambda = 0$ (see prop. below).

This terminology originates in restriction to the special example of the scalar field (example ), where the non-derivative Phi^n interactions $g S_{int} = \underset{n}{\sum} \underset{\Sigma}{\int} g_{sw}^{(n)}(x) (\mathbf{\Phi}(x))^n \, dvol_\Sigma(x)$ (example ) are naturally thought of as potential energy-terms.

From this perspective the effective action in def. is a special case of relative effective actions $S_{eff,\Lambda}$ (“relative effective potentials”, in the case of Phi^n interactions) relative to an arbitrary UV cutoff-scales $\Lambda$ (def. below).
For the special case that

$j A \coloneqq \underset{\Sigma}{\int} j_{sw,a}(x) \mathbf{\Phi}^a(x)\, dvol_{\Sigma}(x)$

is a regular linear observable (def. ) the effective action according to def. is often denoted $W(j)$ or $E(j)$ , and then its functional Legendre transform (if that makes sense) is instead called the effective action, instead.

This is because the latter encodes the equations of motion for the vacuum expectation values $\langle \mathbf{\Phi}(x)_int\rangle$ of the interacting field observables; see example below.

Notice the different meaning of “effective” in both cases: In the first case it refers to what is effectively seen of the full pQFT at some UV-cutoff scale, while in the second case it refers to what is effectively seen when restricting attention only to the vacuum expectation values of regular linear observables.

Proposition

(effective action is logarithm of S-matrix)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. , let $\mathcal{S}$ be an S-matrix scheme for perturbative QFT around this vacuum (def. ) and let

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, h ] ]

be a local observable and let

S_{eff}(g,j) \;\in\; PolyObs(E_{\text{BV-BRST}})((\hbar))[ [ g, j] ]

be the corresponding effective action (def. ).

Then then S-matrix for $g S_{int} + j A$ is the exponential of the effective action with respect to the pointwise product $(-)\cdot (-)$ of observables (def. ):

\begin{aligned} \mathcal{S}(g S_{int} + j A) & = \exp_\cdot\left( \tfrac{1}{i \hbar} S_{eff}(g,j) \right) \\ & \coloneqq 1 + \frac{1}{i \hbar} S_{eff}(g,j) + \frac{1}{(i \hbar)^2} S_{eff}(g,j) \cdot S_{eff}(g,j) + \frac{1}{(i \hbar)^3} S_{eff}(g,j) \cdot S_{eff}(g,j) \cdot S_{eff}(g,j) + \cdots \end{aligned}

Moreover, this relation passes to the vacuum expectation values:

(248)

\begin{aligned} \left\langle {\, \atop \,} \mathcal{S}(g S_{int} + j A) {\, \atop \,} \right\rangle & = \left\langle {\, \atop \,} \exp\left( \tfrac{1}{i \hbar} S_{eff}(g,j) \right) {\, \atop \,} \right\rangle \\ & = e^{\tfrac{1}{i \hbar} \langle S_{eff}(g,j) \rangle} \end{aligned} \,.

Conversely the vacuum expectation value of the effective action is to the logarithm of that of the S-matrix:

\left\langle S_{eff}(g,j) \right\rangle \;=\; i \hbar \, \ln \left\langle \mathcal{S}(g S_{int} + j A) \right\rangle \,.

Proof

By lemma the summands in the $n$ th pointwise power of $\frac{1}{i \hbar}$ times the effective action are precisely the Feynman amplitudes $\Gamma\left((g S_{int} + j A)_{i = 1}^{\nu(\Gamma)}\right)$ of finite multigraphs $\Gamma$ with $n$ connected components, where each such appears with multiplicity given by the factorial of $n$ :

\frac{1}{n!} \left( \frac{1}{i \hbar} S_{eff}(g,j) \right)^n \;=\; \underset{ { \Gamma = \underoverset{j = 1}{n}{\sqcup} \Gamma_j } \atop { \Gamma_j \in \mathcal{G}_{conn} } }{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma)} \right) \,.

It follows that

\begin{aligned} \exp_\cdot\left( \frac{1}{i \hbar} S_{int} \right) & = \underset{n \in \mathbb{N}}{\sum} \underset{ { \Gamma = \underoverset{j = 1}{n}{\sqcup} \Gamma_j } \atop { \Gamma_j \in \mathcal{G}_{conn} } }{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{v(\Gamma)} \right) \\ & = \underset{\Gamma \in \mathcal{G}}{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{v(\Gamma)} \right) \end{aligned}

yields the Feynman perturbation series by expressing it as a series (re-)organized by number of connected components of the Feynman diagrams.

To conclude the proof it is now sufficient to observe that taking vacuum expectation values of polynomial observables respects the pointwise product of observables

\left\langle A_1 \cdot A_2 \right\rangle \;=\; \left\langle A_1 \right\rangle \, \left\langle A_2 \right\rangle \,.

This is because the Hadamard vacuum state $\langle -\rangle \colon PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ] \to \mathbb{C}[ [\hbar, g, j ] ]$ simply picks the zero-order monomial term, by prop. ), and under multiplication of polynomials the zero-order terms are multiplied.

This immediately implies the following important fact:

Proposition

(in stable vacuum the effective action is generating function for vacuum expectation values of interacting field observables)

If the given vacuum state is stable (def. ) then the vacuum expectation value $\langle S_{eff}(g,j)\rangle$ of the effective action (def. ) is the generating function for the vacuum expectation value of the interacting field observable $A_{int}$ (def. ) in that

\left\langle A_{int} \right\rangle \;=\; \frac{d}{d j} S_{eff}(g,j)\vert_{j = 0} \,.

Proof

We compute as follows:

\begin{aligned} \frac{d}{d j} S_{eff}(g,j) & = i \hbar \frac{d}{d j} \ln \left\langle \mathcal{S}(g S_{int} + j A) \right\rangle \vert_{j = 0} \\ & = i \hbar \left\langle \mathcal{S}(g S_{int}) \right\rangle^{-1} \frac{d}{d j} \left\langle \mathcal{S}(g S_{int} + j A) \right\rangle \vert_{j = 0} \\ & = \left\langle \frac{d}{d j} \underset{ \mathcal{Z}(j A) }{ \underbrace{\mathcal{S}(g S_{int})^{-1} \mathcal{S}(g S_{int} + j A) }} \vert_{j = 0} \right\rangle \\ & = \left\langle A_{int} \right\rangle \,. \end{aligned}

Here in the first step we used prop , in the second step we applied the chain rule of differentiation, in the third step we used the definition of vacuum stability (def. ) and in the fourth step we recognized the definition of the interacting field observables (def. ).

Example

(equations of motion for vacuum expectation values of interacting field observables)

Consider the effective action (def. ) for the case that

\begin{aligned} j A & = \tau{\Sigma}( j_{sw} \phi) \\ & = \underset{\Sigma}{\int} j_{sw}(x) \mathbf{\Phi}(x) \, dvol_\Sigma(x) \end{aligned}

is a regular linear observable (this def.), hence the smearing of a field observable (this def.) by an adiabatic switching of the source field

j_{sw} \;\in\; C^\infty_{cp}(\Sigma) \langle j\rangle \,.

(Here we are notationally suppressing internal field indices, for convenience.)

In this case the vacuum expectation value of the corresponding effective action is often denoted

W(j_{sw})

and regarded as a functional of the adiabatic switching $j_{sw}$ of the source field.

In this case prop. says that if the vacuum state is stable, then $W$ is the generating functional for interacting (def. ) field observables (def. ) in that

(249)

\left\langle \mathbf{\Phi}(x)_{int} \right\rangle \;=\; \frac{\delta}{\delta j_{sw}(x)} W(j_{sw} = 0) \,.

Assume then that there exists a corresponding functional $\Gamma(\Phi)$ of the field histories $\Phi \in \Gamma_{\Sigma}(E_{\text{BV-BRST}})$ (def. ), which behaves like a functional Legendre transform of $W$ in that it satisfies the functional version of the defining equation of Legendre transforms (first derivatives are inverse functions of each other, see this equation):

\frac{\delta }{\delta \Phi(x)} \Gamma \left( \frac{\delta}{\delta j_{sw}(y)} W \right) \;=\; \delta(x,y) j_{sw}(x) \,.

By (249) this implies that

\frac{\delta }{\delta \Phi(x)} \Gamma \left( \left\langle \mathbf{\Phi}(x)_{int} \right\rangle \right) \;=\; 0 \,.

This may be read as a quantum version of the principle of extremal action (prop. ) formulated now not for the field histories $\Phi(x)$ , but for the vacuum expectation values $\langle \mathbf{\Phi}(x)_{int}\rangle$ of their corresponding interacting quantum field observables.

Beware, (as in remark ) that many texts refer to $\Gamma(\Phi)$ as the effective action, instead of its Legendre transform, the generating functional $W(j_{sw})$ .

The perspective of the effective action gives a transparent picture of the order of quantum effects involved in the S-matrix, this is prop. below. In order to state this conveniently, we invoke two basic concepts from graph theory:

Definition

(planar graphs and trees)

A finite multigraph (def. ) is called a planar graph if it admits an embedding into the plane, hence if it may be “drawn into the plane” without intersections, in the evident way.

A finite multigraph is called a tree if for any two of its vertices there is at most one path of edges connecting them, these are examples of planar graphs. We write

\mathcal{G}_{tree} \subset \mathcal{G}

for the subset of isomorphism classes of finite multigraphs with linearly orrdered vertices (def. ) on those which are trees.

Proposition

(loop order and tree level of Feynman perturbation series)

The effective action (def. ) contains no negative powers of $\hbar$ , hence is indeed a formal power series also in $\hbar$ :

S_{eff}(g,j) \;\in\; PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ] \,.

and in particular

\left\langle S_{eff}(g,j) \right\rangle \;\in\; \mathbb{C}[ [ \hbar, g, j] ] \,.

Moreover, the contribution to the effective action in the classical limit $\hbar \to 0$ is precisely that of Feynman amplitudes of those finite multigraphs (prop. ) which are trees (def. ); thus called the tree level-contribution:

S_{eff}(g,j)\vert_{\hbar = 0} \;=\; i \hbar \underset{\Gamma \in \mathcal{G}_{conn} \cap \mathcal{G}_{tree}}{\sum} \Gamma\left( (g S_{int} + j A)_{i = 1}^{\nu(\Gamma)} \right) \,.

Finally, a finite multigraph $\Gamma$ (def. ) which is planar (def. ) and connected (def. ) contributes to the effective action precisely at order

\hbar^{L(\Gamma)} \,,

where $L(\Gamma) \in \mathbb{N}$ is the number of faces of $\Gamma$ , here called the number of loops of the diagram; here usually called the loop order of $\Gamma$ .

(Beware the terminology clash with graph theory, see the discussion of tadpoles in remark .)

Proof

By def. the explicit $\hbar$ -dependence of the S-matrix is

\mathcal{S} \left( S_{int} \right) \;=\; \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!} \frac{1}{(i \hbar)^k} T( \underset{k \, \text{factors}}{\underbrace{S_{int}, \cdots, S_{int}}} )

and by prop. the further $\hbar$ -dependence of the time-ordered product $T(\cdots)$ is

T(S_{int}, S_{int}) \;=\; prod \circ \exp\left( \hbar \left\langle \Delta_F, \frac{\delta}{\delta \mathbf{\Phi}} \otimes \frac{\delta}{\delta \mathbf{\Phi}} \right\rangle \right) ( S_{int} \otimes S_{int} ) \,,

By the Feynman rules (prop. ) this means that

each vertex of a Feynman diagram contributes a power $\hbar^{-1}$ to its Feynman amplitude;
each edge of a Feynman diagram contributes a power $\hbar^{+1}$ to its Feynman amplitude.

If we write

E(\Gamma), V(\Gamma) \;\in\; \mathbb{N}

for the total number of vertices and edges, respectively, in $\Gamma$ , this means that a Feynman amplitude corresponding to some $\Gamma \in \mathcal{G}$ contributes precisely at order

(250)

\hbar^{E(\Gamma) - V(\Gamma)} \,.

So far this holds for arbitrary $\Gamma$ . If however $\Gamma$ is connected (def. ) and planar (def. ), then Euler's formula asserts that

(251)

E(\Gamma) - V(\Gamma) \;=\; L(\Gamma) - 1 \,.

Hence $\hbar^{L(\Gamma)- 1}$ is the order of $\hbar$ at which $\Gamma$ contributes to the scattering matrix expressed as the Feynman perturbation series.

But the effective action, by definition (247), has the same contributions of Feynman amplitudes, but multiplied by another power of $\hbar^1$ , hence it contributes at order

\hbar^{E(\Gamma) - V(\Gamma) + 1} = \hbar^{L(\Gamma)} \,.

This proves the second claim on loop order.

The first claim, due to the extra factor of $\hbar$ in the definition of the effective action, is equivalent to saying that the Feynman amplitude of every connected finite multigraph contributes powers in $\hbar$ of order $\geq -1$ and contributes at order $\hbar^{-1}$ precisely if the graph is a tree.

Observe that a connected finite multigraph $\Gamma$ with $\nu \in \mathbb{N}$ vertices (necessarily $\nu \geq 1$ ) has at least $\nu-1$ edges and precisely $\nu - 1$ edges if it is a tree.

To see this, consecutively remove edges from $\Gamma$ as long as possible while retaining connectivity. When this process stops, the result must be a connected tree $\Gamma'$ , hence a connected planar graph with $L(\Gamma') = 0$ . Therefore Euler's formula (251) implies that that $E(\Gamma') = V(\Gamma') -1$ .

This means that the connected multigraph $\Gamma$ in general has a Feynman amplitude of order

\hbar^{E(\Gamma) - V(\Gamma)} = \hbar^{ \overset{\geq 0}{\overbrace{E(\Gamma) - E(\Gamma')}} + \overset{= -1}{\overbrace{E(\Gamma') - V(\Gamma)}} }

and precisely if it is a tree its Feynman amplitude is of order $\hbar^{-1}$ .

$\,$

Vacuum diagrams

With the Feynman perturbation series and the effective action in hand, it is now immediate to see that there is a general contribution by vacuum diagrams (def. below) in the scattering matrix which, in a stable vacuum state, cancels out against the prefactor $\mathcal{S}(g S_{int})$ in Bogoliubov's formula for interacting field observables.

Definition

(vacuum diagrams)

Then a Feynman diagram all whose vertices are internal vertices (def. ) is called a vacuum diagram.

Write

\mathcal{G}^{Feyn}_{vac} \subset \mathcal{G}^{Feyn}

for the subset of isomorphism classes of vacuum diagrams among the set of isomorphism classes of all Feynman diagrams, def. . Similarly write

\mathcal{G}^{Feyn}_{conn,vac} \;\coloneqq\; \mathcal{G}^{Feyn}_{conn} \cap \mathcal{G}^{Feyn}_{vac} \;\subset\; \mathcal{G}^{Feyn}

for the subset of isomorphism classes of Feynman diagrams which are both vacuum diagrams as well as connected graphs (def. ).

Finally write

S_{eff,vac}(g) \;\coloneqq\; \underset{ { (\Gamma,vertlab,edgelab) } \atop { \in \mathcal{G}_{conn,vac} } }{\sum} (\Gamma,vertlab, edgelab)(g S_{int}) \;\in\; PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar , g ] ]

for the sub-series of that for the effective action (def. ) given only by those connected diagrams which are also vacuum diagrams.

Example

(2-vertex vacuum diagram in QED)

The vacuum diagram (def. ) with two electron-photon interaction-vertices in quantum electrodynamics (example ) is:

Example

(vacuum diagram-contribution to S-matrices)

Then the Feynman perturbation series-expansion of the S-matrix (example ) of the interaction-term $g S_{int}$ alone (no source field-contribution) is the series of Feynman amplitudes that are labeled by vacuum diagrams (def. ), hence (by prop. ) the exponential of the vacuum effective action $S_{eff,vac}$ (def. ):

\begin{aligned} \mathcal{S}(g S_{int}) & = \exp_\cdot\left( \tfrac{1}{i \hbar} S_{eff,vac}(g,j) \right) \\ & = \underset{\Gamma \in \mathcal{G}_{vac}}{\sum} \Gamma\left(g S_{int}\right) \end{aligned} \,.

More generally, the S-matrix with source field-contribution $j A$ included always splits as a pointwise product of the vacuum S_matrix with the Feynman perturbation series over all Feynman graphs with at least one external vertex:

\begin{aligned} \mathcal{S}(g S_{int} + j A) \;=\; \mathcal{S}(g S_{int}) \cdot \underset{ \text{Feynman perturbation series} \atop \text{over diagrams with at least one external vertex} }{ \underbrace{ \exp_\cdot \left( \tfrac{1}{i \hbar} \left( S_{eff}(g,j) - S_{eff,vac}(g) \right) \right) } } \,, \end{aligned}

Hence if the free field vacuum state is stable with respect to the interaction $g S_{int}$ , according to def. , then the vacuum expectation value of a time-ordered product of interacting field observables $j (A_i)_{int}$ (example ) and hence in particular of scattering amplitudes (example ) is given by the Feynman perturbation series (example ) over just the non-vacuum Feynman diagrams, hence over all those diagram that have at least one one external vertex

\begin{aligned} & \left( {\, \atop \,} supp(A_1) {\vee\!\!\!\wedge} supp(A_2) {\vee\!\!\!\wedge} \cdots {\vee\!\!\!\wedge} supp(A_n) {\, \atop \,} \right) \\ & \Rightarrow \left\langle {\, \atop \,} (A_1)_int (A_2)_{int} \cdots (A_n)_{int} {\, \atop \,} \right\rangle \;=\; \frac{d^n}{ d j_1 \cdots d j_n} \left( \underset{\Gamma \in \mathcal{G} \setminus \mathcal{G}_{vac} }{\sum} \Gamma(g S_{int} + \sum_i j_i A_i) \right)_{ \vert j_1, \cdots, j_n = 0 } \,. \end{aligned}

This is the way in which the Feynman perturbation series is used in practice for computing scattering amplitudes.

$\,$

Interacting quantum BV-Differential

So far we have discussed, starting with a BV-BRST gauge fixed free field vacuum, the perturbative construction of interacting field algebras of observables (def. ) and their organization in increasing powers of $\hbar$ and $g$ (loop order, prop. ) via the Feynman perturbation series (example , example ).

But this interacting field algebra of observables still involves all the auxiliary fields of the BV-BRST gauge fixed free field vacuum (example ), while the actual physical gauge invariant on-shell observables should be (just) the cochain cohomology of the BV-BRST differential on this enlarged space of observables. Hence for the construction of perturbative QFT to conclude, it remains to pass the BV-BRST differential of the free field Wick algebra of observables to a differential on the interacting field algebra, such that its cochain cohomology is well defined.

Since the time-ordered products away from coinciding interaction points and as well as on regular polynomial observables are uniquely fixed (prop. ), one finds that also this interacting quantum BV-differential is uniquely fixed, on regular polynomial observables, by conjugation with the quantum Møller operators (def. ). The formula that characterizes it there is called the quantum master equation or equivalently the quantum master Ward identity (prop. below).

When extending to coinciding interaction points via ("re"-)normalization (def. ) these identities are not guaranteed to hold anymore, but may be imposed as renormalization conditions (def. , prop. ). Quantum correction to the master Ward identity then imply corrections to Noether current conservation laws; this we discuss below.

$\,$

For the following discussion, recall from the previous chapter how the global BV-differential

\{S',-\} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

on regular polynomial observables (def. ) as well as the global antibracket $\{-,-\}$ (def. ) are conjugated into the time-ordered product via the time ordering operator $\mathcal{T} \circ \{-S',-\} \circ \mathcal{T}^{-}$ (def. , prop. ), which makes

In the same way we may use the quantum Møller operators to conjugate the BV-differential into the regular part of the interacting field algebra of observables:

Definition

(interacting quantum BV-differential)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a relativistic free vacuum according to def. and let

S_{int} \;\in\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [ \hbar, g, j] ]

be a regular polynomial observables, regarded as an adiabatically switched non-point-interaction action functional.

Then the interacting quantum BV-differential on the interacting field algebra on regular polynomial observables (def. ) is the conjugation of the plain global BV-differential $\{-S',-\}$ (def. ) by the quantum Møller operator induced by $S_{int}$ (def. ):

\mathcal{R} \circ \{-S', (-)\} \circ \mathcal{R}^{-1} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \,.

(Rejzner 11, (5.38))

Proposition

(quantum master equation and quantum master Ward identity on regular polynomial observables)

Consider an adiabatically switched non-point-interaction action functional in the form of a regular polynomial observable in degree 0

S_{int} \;\in\; PolyObs(E_{\text{BV-BRST}})_{{reg} \atop {deg = 0}}[ [\hbar] ] \,,

Then the following are equivalent:

The quantum master equation (QME)

(252) $\tfrac{1}{2} \{ S' + S_{int}, S' + S_{int} \}_{\mathcal{T}} + i \hbar \Delta_{BV}( S' + S_{int} ) \;=\; 0 \,.$
The perturbative S-matrix (def. ) is $BV$ -closed

$\{-S', \mathcal{S}(S_{int})\} = 0 \,.$
The quantum master Ward identity (MWI) on regular polynomial observables in terms of retarded products:

(253) $\mathcal{R} \circ \{-S',(-)\} \circ \mathcal{R}^{-1} \;=\; - \left( \left\{ S' + S_{int} \,,\, (-) \right\}_{\mathcal{T}} + i \hbar \Delta_{BV} \right)$

(Dütsch 18, (4.2))

expressing the interacting quantum BV-differential (def. ) as the sum of the time-ordered antibracket (def. ) with the total action functional $S' + S_{int}$ and $i \hbar$ times the BV-operator (BV-operator).
The quantum master Ward identity (MWI) on regular polynomial observables in terms of time-ordered products:

(254) $\mathcal{S}(-S_{int}) \star_F \{-S', \mathcal{S}(S_{int}) \star_F (-)\} \;=\; - \left( \left\{ S' + S_{int} \,,\, (-) \right\}_{\mathcal{T}} + i \hbar \Delta_{BV} \right)$

(Dütsch 18, (4.8))

(Rejzner 11, (5.35) - (5.38), following Hollands 07, (342)-(345))

Proof

To see that the first two conditions are equivalent, we compute as follows

(255)

\begin{aligned} \left\{ -S', \mathcal{S}(S_{int}) \right\} & = \left\{ -S' , \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) \right\} \\ & = \underset{ { \tfrac{-1}{i \hbar} \{S',S\}_{\mathcal{T}} } \atop { \star_F \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) } }{ \underbrace{ \left\{ -S' , \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) \right\}_{\mathcal{T}} } } - i \hbar \underset{ { \left( \tfrac{1}{i \hbar} \Delta_{BV}(S_{int}) + \tfrac{1}{2 (i \hbar)^2} \left\{ S_{int}, S_{int} \right\}_{\mathcal{T}} \right) } \atop { \star_{F} \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) } }{ \underbrace{ \Delta_{BV} \left( \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) \right) } } \\ & = \tfrac{-1}{i \hbar} \underset{ \text{QME} }{ \underbrace{ \left( \{S',S_{int}\} + \tfrac{1}{2}\{S_{int}, S_{int}\} + i \hbar \Delta_{BV}(S_{int}) \right) } } \star_F \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) \end{aligned}

Here in the first step we used the definition of the BV-operator (def. ) to rewrite the plain antibracket in terms of the time-ordered antibracket (def. ), then under the second brace we used that the time-ordered antibracket is the failure of the BV-operator to be a derivation (prop. ) and under the first brace the consequence of this statement for application to exponentials (example ). Finally we collected terms, and to “complete the square” we added the terms on the left of

\frac{1}{2} \underset{= 0}{\underbrace{\{S', S'\}_{\mathcal{T}}}} - i \hbar \underset{ = 0}{\underbrace{ \Delta_{BV}(S')}} = 0

which vanish because, by definition of gauge fixing (def. ), the free gauge-fixed action functional $S'$ is independent of antifields.

But since the operation $(-) \star_F \exp_{\mathcal{T}}\left( \tfrac{1}{i \hbar} S_{int} \right)$ has the inverse $(-) \star_F \exp_{\mathcal{T}}\left( \tfrac{-1}{i \hbar} S_{int} \right)$ , this implies the claim.

Next we show that the quantum master equation implies the quantum master Ward identities.

We use that the BV-differential $\{-S',-\}$ is a derivation of the Wick algebra product $\star_H$ (lemma ).

First of all this implies that with $\{-S', \mathcal{S}(S_{int})\} = 0$ also $\{-S', \mathcal{S}(S_{int})^{-1}\} = 0$ .

Thus we compute as follows:

\begin{aligned} \{-S', -\} \circ \mathcal{R}^{-1}(A) & = \{-S', \mathcal{R}^{-1}(A)\} \\ & = \left\{ { \, \atop \, } -S', \mathcal{S}(S_{int})^{-1} \star_H \left( \mathcal{S}(S_{int}) \star_F a \right) {\, \atop \,} \right\} \\ & = \phantom{+} \underset{ = 0 }{ \underbrace{ \left\{ -S', \mathcal{S}(S_{int})^{-1} \right\} } } \star_H \left( \mathcal{S}(S_{int}) \star_F A \right) \\ & \phantom{=} + \mathcal{S}(S_{int})^{-1} \star_H \left\{ -S', \mathcal{S}(S_{int}) \star_F A \right\} \\ & = \mathcal{S}(S_{int})^{-1} \star_H \left( \underset{ = 1 }{ \underbrace{ \mathcal{S}(+ S_{int}) \star_F \mathcal{S}(- S_{int}) } } \star_F \left\{ -S', \mathcal{S}(S_{int}) \star_F A \right\} \right) \\ & = \mathcal{S}(S_{int})^{-1} \star_H \left( \mathcal{S}(+ S_{int}) \star_F \underset{ (\ast) }{ \underbrace{ \mathcal{S}(- S_{int}) \star_F \left\{ -S', \mathcal{S}(S_{int}) \star_F A \right\} } } \right) \\ & = \mathcal{R}^{-1} \left( \underset{ (\ast) }{ \underbrace{ \phantom{\, \atop \,} \mathcal{S}(-S_{int}) \star_F \left\{ -S', \mathcal{S}(S_{int}) \star_F A \right\} } } \right) \end{aligned}

By applying $\mathcal{R}$ to both sides of this equation, this means first of all that the interacting quantum BV-differential is equivalently given by

\mathcal{R} \circ \{-S', (-)\} \circ \mathcal{R}^{-1} \;=\; \mathcal{S}(-S_{int}) \star_F \{-S', \mathcal{S}(S_{int}) \star_F (-)\} \,,

hence that if either version (253) or (257) of the master Ward identity holds, it implies the other.

Now expanding out the definition of $\mathcal{S}$ (def. ) and expressing $\{-S',-\}$ via the time-ordered antibracket (def. ) and the BV-operator $\Delta_{BV}$ (prop. ) as

\{-S',-\} \;=\; \{-S',-\}_{\mathcal{T}} - i \hbar \Delta_{BV}

(on regular polynomial observables), we continue computing as follows:

(256)

\begin{aligned} & \mathcal{R} \circ \{-S', (-)\} \circ \mathcal{R}^{-1}( A ) \\ & = \exp_{\mathcal{T}} \left( \tfrac{-1}{i \hbar} S_{int} \right) \star_F \left\{ -S', \exp_{\mathcal{T}} \left( \tfrac{1}{i \hbar} S_{int} \right) \star_F A \right\} \\ & = \exp_{\mathcal{T}} \left( \tfrac{-1}{i \hbar} S_{int} \right) \star_F \left( \left\{ -S', \exp_{\mathcal{T}} \left( \tfrac{ 1 }{i \hbar} S_{int} \right) \star_F A \right\}_{\mathcal{T}} - i \hbar \Delta_{BV} \left( \exp_{\mathcal{T}} \left( \tfrac{ 1 }{i \hbar} S_{int} \right) \star_F A \right) \right) \\ & \phantom{+} = \tfrac{1}{i \hbar} \{ -S', S_{int} \}_{\mathcal{T}} \star_F A + \{-S', A\}_{\mathcal{T}} \\ & \phantom{=} - i \hbar \exp_{\mathcal{T}}\left( \tfrac{-1}{i \hbar} S_{int}\right) \star_F \left( \underset{ { \left( \tfrac{1}{i \hbar}\Delta_{BV}(S_{int}) + \tfrac{1}{2 (i \hbar)^2} \left\{ S_{int}, S_{int} \right\} \right) } \atop { \star_F \exp_{\mathcal{T}}\left( \tfrac{ 1 }{i \hbar} S_{int} \right) } }{ \underbrace{ \Delta_{BV} \left( \exp_{\mathcal{T}} \left( \tfrac{ 1}{i \hbar} S_{int} \right) \right) } } \star_F A \,+\, \exp_{\mathcal{T}} \left( \tfrac{ 1}{i \hbar} S_{int} \right) \star_F \Delta_{BV}(A) \,+\, \underset{ { \exp_{\mathcal{T}}\left( \tfrac{1}{i \hbar} S_{int} \right) } \atop { \star_F \tfrac{ 1}{i \hbar} \{S_{int}, A\} } }{ \underbrace{ \left\{ \exp_{\mathcal{T}} \left( \tfrac{ 1}{i \hbar} S_{int} \right) \,,\, A \right\}_{\mathcal{T}} } } \right) \\ & = - \left( \{ S' + S_{int}\,,\, A\}_{\mathcal{T}} + i \hbar \Delta_{BV}(A) \right) \\ & \phantom{=} - \tfrac{1}{i \hbar} \underset{ \text{QME} }{ \underbrace{ \left( \tfrac{1}{2} \{ S' + S_{int}, S' + S_{int} \}_{\mathcal{T}} + i \hbar \Delta_{BV}( S' + S_{int} ) \right) }} \star_F A \\ & = - \left( \{ S' + S_{int}\,,\, A\}_{\mathcal{T}} + i \hbar \Delta_{BV}(A) \right) \end{aligned}

Here in the line with the braces we used that the BV-operator is a derivation of the time-ordered product up to correction by the time-ordered antibracket (prop. ), and under the first brace we used the effect of that property on time-ordered exponentials (example ), while under the second brace we used that $\{(-),A\}_{\mathcal{T}}$ is a derivation of the time-ordered product. Finally we have collected terms, added $0 = \{S',S'\} + i \hbar \Delta_{BV}(S')$ as before, and then used the QME.

This shows that the quantum master Ward identities follow from the quantum master equation. To conclude, it is now sufficient to show that, conversely, the MWI in terms of, say, retarded products implies the QME.

To see this, observe that with the BV-differential being nilpotent, also its conjugation by $\mathcal{R}$ is, so that with the above we have:

\begin{aligned} & \left( \{-S',-\}\right)^2 = 0 \\ \Leftrightarrow \; & \left( \mathcal{R} \circ \{-S',(-)\} \circ \mathcal{R}^{-1} \right)^2 = 0 \\ \Leftrightarrow \; & \underset{ \left\{ {\, \atop \,} \tfrac{1}{2}\{S' + S_{int}, S' + S_{int}\}_{\mathcal{T}} + i \hbar \Delta_{BV}(S' + S_{int}) \,,\, (-) \right\} }{ \underbrace{ \left( \{S' + S_{int}, (-)\}_{\mathcal{T}} + i \hbar \Delta_{BV} \right)^2 } } = 0 \end{aligned}

Here under the brace we computed as follows:

\begin{aligned} \left( \{S' + S_{int}, (-)\}_{\mathcal{T}} + i \hbar \Delta_{BV} \right)^2 & = \phantom{+} \underset{ \tfrac{1}{2} \{ \{S' + S, S'+ S\}_{\mathcal{T}}, (-) \}_{\mathcal{T}} }{ \underbrace{ \{S' + S_{int}, \{S' + S_{int}\}_{\mathcal{T}}, (-) \}_{\mathcal{T}} }} \\ & \phantom{=} + i \hbar \underset{ \{ \Delta_{BV}(S'+ S)\,,\, (-) \}_{\mathcal{T}} }{ \underbrace{ \left( \{S' + S_{int}, (-)\}_{\mathcal{T}} \circ \Delta_{BV} + \Delta_{BV} \circ \{S' + S_{int}, (-)\}_{\mathcal{T}} \right) }} \\ & \phantom{=} + (i \hbar)^2 \underset{= 0} { \underbrace{ \Delta_{BV} \circ \Delta_{BV} } } \end{aligned} \,.

where, in turn, the term under the first brace follows by the graded Jacobi identity, the one under the second brace by Henneaux-Teitelboim (15.105c) and the one under the third brace by Henneaux-Teitelboim (15.105b).

$\,$

Ward identities

The quantum master Ward identity (prop. ) expresses the relation between the quantum (measured by Planck's constant $\hbar$ ) interacting (measured by the coupling constant $g$ ) equations of motion to the classical free field equations of motion at $\hbar, g\to 0$ (remark below). As such it generalizes the Schwinger-Dyson equation (prop. ), to which it reduces for $g = 0$ (example below) as well as the classical master Ward identity, which is the case for $\hbar = 0$ (example below).

Applied to products of the equations of motion with any given observable, the master Ward identity becomes a particular Ward identity.

This is of interest notably in view of Noether's theorem (prop. ), which says that every infinitesimal symmetry of the Lagrangian of, in particular, the given free field theory, corresponds to a conserved current (def. ), hence a horizontal differential form whose total spacetime derivative vanishes up to a term proportional to the equations of motion. Under transgression to local observables this is a relation of the form

div \mathbf{J} = 0 \phantom{AAA} \text{on-shell} \,,

where “on shell” means up to the ideal generated by the classical free equations of motion. Hence for the case of local observables of the form $div \mathbf{J}$ , the quantum Ward identity expresses the possible failure of the original conserved current to actually be conserved, due to both quantum effects ( $\hbar$ ) and interactions ( $g$ ). This is the form in which Ward identities are usually understood (example below).

As one extends the time-ordered products to coinciding interaction points in ("re"-)normalization of the perturbative QFT (def. ), the quantum master equation/master Ward identity becomes a renormalization condition (def. , prop. ). If this condition fails, one speaks of a quantum anomaly. Specifically if the Ward identity for an infinitesimal gauge symmetry is violated, one speaks of a gauge anomaly.

$\,$

Definition

Consider a free gauge fixed Lagrangian field theory $(E_{\text{BV-BRST}}, \mathbf{L}')$ (def. ) with global BV-differential on regular polynomial observables

\{-S',(-)\} \;\colon\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{reg}[ [\hbar] ]

(def. ).

Let moreover

g S_{int} \;\in\; PolyObs(E_{\text{BV-BRST}})_{reg}[ [ \hbar , g ] ]

be a regular polynomial observable (regarded as an adiabatically switched non-point-interaction action functional) such that the total action $S' + g S_{int}$ satisfies the quantum master equation (prop. ); and write

\mathcal{R}^{-1}(-) \;\coloneqq\; \mathcal{S}(g S_{int})^{-1} \star_H (\mathcal{S}(g S_{int}) \star_F (-))

for the corresponding quantum Møller operator (def. ).

Then by prop. we have

(257)

\{-S',(-)\} \circ \mathcal{R}^{-1} \;=\; \mathcal{R}^{-1} \left(\left\{ -(S' + g S_{int}) \,,\, (-) \right\}_{\mathcal{T}} -i \hbar \Delta_{BV}\right)

This is the quantum master Ward identity on regular polynomial observables, i.e. before renormalization.

(Rejzner 13, (37))

Remark

(quantum master Ward identity relates quantum interacting field EOMs to classical free field EOMs)

For $A \in PolyObs(E_{\text{BV-BRST}})_{reg}[ [ \hbar, g] ]$ the quantum master Ward identity on regular polynomial observables (257) reads

(258)

\mathcal{R}^{-1} \left( \left\{ -(S' + g S_{int}) \,,\, A \right\}_{\mathcal{T}} - i \hbar \Delta_{BV}(A) \right) \;=\; \{-S', \mathcal{R}^{-1}(A) \}

The term on the right is manifestly in the image of the global BV-differential $\{-S',-\}$ of the free field theory (def. ) and hence vanishes when passing to on-shell observables along the isomorphism (198)

\underset{ \text{on-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}}, \mathbf{L}') }} \;\simeq\; \underset{ \text{off-shell} }{ \underbrace{ PolyObs(E_{\text{BV-BRST}})_{def(af = 0)} }}/im(\{-S',-\})

(by example ).

Hence

\mathcal{R}^{-1} \left( \left\{ -(S' + g S_{int}) \,,\, A \right\}_{\mathcal{T}} - i \hbar \Delta_{BV}(A) \right) \;=\; 0 \phantom{AAA} \text{on-shell}

In contrast, the left hand side is the interacting field observable (via def. ) of the sum of the time-ordered antibracket with the action functional of the interacting field theory and a quantum correction given by the BV-operator. If we use the definition of the BV-operator $\Delta_{BV}$ (def. ) we may equivalently re-write this as

(259)

\mathcal{R}^{-1} \left( \left\{ -S' \,,\, A \right\} + \left\{ -g S_{int} \,,\, A \right\}_{\mathcal{T}} \right) \;=\; 0 \phantom{AAA} \text{on-shell}

Hence the quantum master Ward identity expresses a relation between the ideal spanned by the classical free field equations of motion and the quantum interacting field equations of motion.

Example

(free field-limit of master Ward identity is Schwinger-Dyson equation)

In the free field-limit $g \to 0$ (noticing that in this limit $\mathcal{R}^{-1} = id$ ) the quantum master Ward identity (257) reduces to

\left\{ -S' \,,\, A \right\}_{\mathcal{T}} - i \hbar \Delta_{BV}(A) \;=\; \{-S', A \}

which is the defining equation for the BV-operator (221), hence is isomorphic (under $\mathcal{T}$ ) to the Schwinger-Dyson equation (prop. )

Example

(classical limit of quantum master Ward identity)

In the classical limit $\hbar \to 0$ (noticing that the classical limit of $\{-,-\}_{\mathcal{T}}$ is $\{-,-\}$ ) the quantum master Ward identity (257) reduces to

\mathcal{R}^{1} \left( \left\{ -(S' + g S_{int}) \,,\, A \right\} \right) \;=\; \{-S', \mathcal{R}^{-1}(A) \}

This says that the interacting field observable corresponding to the global antibracket with the action functional of the interacting field theory vanishes on-shell, classically.

Applied to an observable which is linear in the antifields

A \;=\; \underset{\Sigma}{\int} A^a(x) \mathbf{\Phi}^\ddagger_a(x) \, dvol_\Sigma(x)

this yields

\begin{aligned} 0 & = \{-S', \mathcal{R}^{-1}(A)\} + \mathcal{R}^{-1} \left( \left\{ -(S' + S_{int}) \,,\, A \right\}_{\mathcal{T}} \right) \\ & = \underset{\Sigma}{\int} \frac{\delta S'}{\delta \mathbf{\Phi}^a(x)} \mathcal{R}^{-1}(A^a(x)) \, dvol_\Sigma(x) + \mathcal{R}^{-1} \left( \underset{\Sigma}{\int} A^a(x) \frac{\delta (S' + S_{int})}{\delta \mathbf{\Phi}^a(x)} \, dvol_\Sigma(x) \right) \end{aligned}

This is the classical master Ward identity according to (Dütsch-Fredenhagen 02, Brennecke-Dütsch 07, (5.5)), following (Dütsch-Boas 02).

Example

(quantum correction to Noether current conservation)

Let $v \in \Gamma^{ev}_\Sigma(T_\Sigma(E_{\text{BRST}}))$ be an evolutionary vector field, which is an infinitesimal symmetry of the Lagrangian $\mathbf{L}'$ , and let $J_{\hat v} \in \Omega^{p,0}_\Sigma(E_{\text{BV-BRST}})$ the corresponding conserved current, by Noether's theorem I (prop. ), so that

\begin{aligned} d J_{\hat v} & = \iota_{\hat v} \delta \mathbf{L}' \\ & = (v^a dvol_\Sigma) \frac{\delta_{EL} L'}{\delta \phi^a} \phantom{AAA} \in \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}}) \end{aligned}

by (80), where in the second line we just rewrote the expression in components (50)

v^a \,, \frac{\delta_{EL} L'}{\delta \phi^a} \;\in \Omega^{0,0}_\Sigma(E_{\text{BV-BRST}})

and re-arranged suggestively.

Then for $a_{sw} \in C^\infty_{cp}(\Sigma)$ any choice of bump function, we obtain the local observables

\begin{aligned} A_{sw} & \coloneqq \underset{\Sigma}{\int} \underset{ A^a(x) }{ \underbrace{ a_{sw}(x) v^a( \mathbf{\Phi}(x), D\mathbf{\Phi}(x), \cdots ) } } \mathbf{\Phi}^\ddagger_a(x) \, dvol_\Sigma(x) \\ & \coloneqq \tau_\Sigma( a_{sw} v^a \phi^{\ddagger}_a \, dvol_\Sigma) \end{aligned}

and

\begin{aligned} (div \mathbf{J})_{sw} & \coloneqq \underset{\Sigma}{\int} \underset{ A^a(x) }{ \underbrace{ a_{sw}(x) v^a( \mathbf{\Phi}(x), D\mathbf{\Phi}(x), \cdots ) } } \frac{\delta S'}{\delta \mathbf{\Phi}^a(x)} \, dvol_\Sigma(x) \\ & \coloneqq \tau_\Sigma \left( a_{sw} v^a \frac{\delta_{EL} \mathbf{L}'}{\delta \phi^a} \, dvol_\Sigma \right) \end{aligned}

by transgression of variational differential forms.

This is such that

\left\{ -S' , A_{sw} \right\} = (div \mathbf{J})_{sw} \,.

Hence applied to this choice of local observable $A$ , the quantum master Ward identity (259) now says that

\mathcal{R}^{-1} \left( {\, \atop \,} (div \mathbf{J})_{sw} \right) \;=\; \mathcal{R}^{-1} \left( {\, \atop \,} \{g S_{int}, A_{sw} \}_{\mathcal{T}} {\, \atop \,} \right) \phantom{AAA} \text{on-shell}

Hence the interacting field observable-version $\mathcal{R}^{-1}(div\mathbf{J})$ of $div \mathbf{J}$ need not vanish itself on-shell, instead there may be a correction as shown on the right.

$\,$

This concludes our discussion of perturbative quantum observables of interacting field theories. In the next chapter wé discuss explicitly the inductive construction via ("re"-)normalization of time-ordered products/Feynman amplitudes as well as the various incarnations of the re-normalization group passing between different choices of such ("re"-)normalizations.

Renormalization

In this chapter we discuss the following topics:

Epstein-Glaser normalization
Stückelberg-Petermann re-normalization
UV-Regularization via Counterterms
Wilson-Polchinski effective QFT flow
Renormalization group flow
Gell-Mann & Low RG Flow

$\,$

In the previous chapter we have seen that the construction of interacting perturbative quantum field theories is given by perturbative S-matrix schemes (def. ), equivalently by time-ordered products (def. ) or equivalently by Feynman amplitudes (prop. ). These are uniquely fixed away from coinciding interaction points (prop. ) by the given local interaction (prop. ), but involve further choices of interactions whenever interaction vertices coincide (prop. ). This choice is called the choice of ("re"-)normalization (def. ) in perturbative QFT.

In this rigorous discussion no “infinite divergent quantities” (as in the original informal discussion due to Schwinger-Tomonaga-Feynman-Dyson) that need to be “re-normalized” to finite well-defined quantities are ever considered, instead finite well-defined quantities are considered right away, and the available space of choices is determined. Therefore making such choices is rather a normalization of the time-ordered products/Feynman amplitudes (as prominently highlighted in Scharf 95, see title, introduction, and section 4.3). Actual re-normalization is the the change of such normalizations.

The construction of perturbative QFTs may be explicitly described by an inductive extension of distributions of time-ordered products/Feynman amplitudes to coinciding interaction points. This is called

Epstein-Glaser renormalization.

This inductive construction has the advantage that it gives accurate control over the space of available choices of (“re”-)normalizations (theorem below) but it leaves the nature of the “new interactions” that are to be chosen at coinciding interaction points somwewhat implicit.

Alternatively, one may re-define the interactions explicitly (by adding “counterterms”, remark below), depending on a chosen UV cutoff-scale (def. below), and construct the limit as the “cutoff is removed” (prop. below). This is called (“re”-)normalization by

UV-Regularization via Counterterms.

This still leaves open the question how to choose the counterterms. For that it serves to understand the relative effective action induced by the choice of UV cutoff at any given cutoff scale (def. below). This is the perspective of effective quantum field theory (remark below).

The infinitesimal change of these relative effective actions follows a universal differential equation, known as Polchinski's flow equation (prop. below). This makes the problem of (“re”-)normalization be that of solving this differential equation subject to chosen initial data. This is the perspective on (“re”-)normalization called

Wilson-Polchinski effective QFT flow.

The main theorem of perturbative renormalization (theorem below) states that different S-matrix schemes are precisely related by vertex redefinitions. This yields the

Stückelberg-Petermann renormalization group.

If a sub-collection of renormalization schemes is parameterized by some group $RG$ , then the main theorem implies vertex redefinitions depending on pairs of elements of $RG$ (prop. below). This is known as

Renormalization group flow

Specifically scaling transformations on Minkowski spacetime yield such a collection of renormalization schemes (prop. below); the corresponding renormalization group flow is known as

Gell-Mann & Low RG flow.

The infinitesimal behaviour of this flow is known as the beta function, describing the running of the coupling constants with scale (def. below).

$\,$

Epstein-Glaser normalization

The construction of perturbative quantum field theories around a given gauge fixed relativistic free field vacuum is equivalently, by prop. , the construction of S-matrices $\mathcal{S}(g S_{int} + j A)$ in the sense of causal perturbation theory (def. ) for the given local interaction $g S_{int} + j A$ . By prop. the construction of these S-matrices is inductively in $k \in \mathbb{N}$ a choice of extension of distributions (remark and def. below) of the corresponding $k$ -ary time-ordered products of the interaction to the locus of coinciding interaction points. An inductive construction of the S-matrix this way is called Epstein-Glaser-("re"-)normalization (def. ).

By paying attention to the scaling degree (def. below) one may precisely characterize the space of choices in the extension of distributions (prop. below): For a given local interaction $g S_{int} + j A$ it is inductively in $k \in \mathbb{N}$ a finite-dimensional affine space. This conclusion is theorem below.

$\,$

Proposition

(("re"-)normalization is inductive extension of time-ordered products to diagonal)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge-fixed relativistic free vacuum according to def. ).

\Sigma^{n+1} \setminus diag(n) \;=\; \left\{ (x_i \in \Sigma)_{i = 1}^n \;\vert\; \underset{i,j}{\exists} (x_i \neq x_j) \right\}

of the image of the diagonal inclusion $\Sigma \overset{diag}{\longrightarrow} \Sigma^{n}$ (where we regarded $T_{n+1}$ as a generalized function on $\Sigma^{n+1}$ according to remark ).

Proof

We now say this in detail:

For $I \subset \{1, \cdots, n+1\}$ write $\overline{I} \coloneqq \{1, \cdots, n+1\} \setminus I$ . For $I, \overline{I} \neq \emptyset$ , define the subset

\mathcal{C}_I \;\coloneqq\; \left\{ (x_i)_{i \in \{1, \cdots, n+1\}} \in \Sigma^{n+1} \;\vert\; \{x_i\}_{i \in I} {\vee\!\!\!\wedge} \{x_j\}_{j \in \{1, \cdots, n+1\} \setminus I} \right\} \;\subset\; \Sigma^{n+1} \,.

\underset{ { I \subset \{1, \cdots, n+1\} \atop { I, \overline{I} \neq \emptyset } } }{\cup} \mathcal{C}_I \;=\; \Sigma^{n+1} \setminus diag(\Sigma) \,.

Hence the condition of causal factorization on $T_{n+1}$ implies that restricted to any $\mathcal{C}_{I}$ these have to be given (in the condensed generalized function-notation from remark ) on any unordered tuple $\mathbf{X} = \{x_1, \cdots, x_{n+1}\} \in \mathcal{C}_I$ with corresponding induced tuples $\mathbf{I} \coloneqq \{x_i\}_{i \in I}$ and $\overline{\mathbf{I}} \coloneqq \{x_i\}_{i \in \overline{I}}$ by

(260)

T_{n+1}( \mathbf{X} ) \;=\; T(\mathbf{I}) T(\overline{\mathbf{I}}) \phantom{AA} \text{for} \phantom{A} \mathcal{X} \in \mathcal{C}_I \,.

I_1, I_2 \subset \{1, \cdots, n+1\} \,, \phantom{AA} I_1, I_2, \overline{I_1}, \overline{I_2} \neq \emptyset \,.

By definition this implies that for

\mathbf{X} \in \mathcal{C}_{I_1} \cap \mathcal{C}_{I_2}

a tuple of spacetime points which decomposes into causal order with respect to both these subsets, the corresponding mixed intersections of tuples are spacelike separated:

\mathbf{I}_1 \cap \overline{\mathbf{I}_2} \; {\gt\!\!\!\!\lt} \; \overline{\mathbf{I}_1} \cap \mathbf{I}_2 \,.

By the assumption that the $\{T_k\}_{k \neq n}$ satisfy causal factorization, this implies that the corresponding time-ordered products commute:

(261)

T(\mathbf{I}_1 \cap \overline{\mathbf{I}_2}) \, T(\overline{\mathbf{I}_1} \cap \mathbf{I}_2) \;=\; T(\overline{\mathbf{I}_1} \cap \mathbf{I}_2) \, T(\mathbf{I}_1 \cap \overline{\mathbf{I}_2}) \,.

\begin{aligned} T( \mathbf{I}_1 ) T( \overline{\mathbf{I}_1} ) & = T( \mathbf{I}_1 \cap \mathbf{I}_2 ) T( \mathbf{I}_1 \cap \overline{\mathbf{I}_2} ) \, T( \overline{\mathbf{I}_1} \cap \mathbf{I}_2 ) T( \overline{\mathbf{I}_1} \cap \overline{\mathbf{I}_2} ) \\ & = T( \mathbf{I}_1 \cap \mathbf{I}_2 ) \underbrace{ T( \overline{\mathbf{I}_1} \cap \mathbf{I}_2 ) T( \mathbf{I}_1 \cap \overline{\mathbf{I}_2} ) } T( \overline{\mathbf{I}_1} \cap \overline{\mathbf{I}_2} ) \\ & = T( \mathbf{I}_2 ) T( \overline{\mathbf{I}_2} ) \end{aligned}

Here in the first step we expanded out the two factors using (260) for $I_2$ , then under the brace we used (261) and in the last step we used again (260), but now for $I_1$ .

To conclude, let

(262)

\left( \chi_I \in C^\infty_{cp}(\Sigma^{n+1}), \, supp(\chi_I) \subset \mathcal{C}_i \right)_{ { I \subset \{1, \cdots, n+1\} } \atop { I, \overline{I} \neq \emptyset } }

be a partition of unity subordinate to the open cover formed by the $\mathcal{C}_I$ :

Then the above implies that setting for any $\mathbf{X} \in \Sigma^{n+1} \setminus diag(\Sigma)$

(263)

T_{n+1}(\mathbf{X}) \;\coloneqq\; \underset{ { I \in \{1, \cdots, n+1\} } \atop { I, \overline{I} \neq \emptyset } }{\sum} \chi_i(\mathbf{X}) T( \mathbf{I} ) T( \overline{\mathbf{I}} )

is well defined and satisfies causal factorization.

Remark

(time-ordered products of fixed interaction as distributions)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge-fixed relativistic free vacuum according to def. , and assume that the field bundle is a trivial vector bundle (example )

and let

g S_{int} + j A \;\in\; LoObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle

be a polynomial local observable as in def. , to be regarded as a adiabatically switched interaction action functional. This means that there is a finite set

\left\{ \mathbf{L}_{int,i}, \mathbf{\alpha}_{i'} \in \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}}) \right\}_{i,i'}

of Lagrangian densities which are monomials in the field and jet coordinates, and a corresponding finite set

\left\{ g_{sw,i} \in C^\infty_{cp}(\Sigma)\langle g \rangle \,,\, j_{sw,i'} \in C^\infty_{cp}(\Sigma)\langle j \rangle \right\}

of adiabatic switchings, such that

g S_{int} + j A \;=\; \tau_{\Sigma} \left( \underset{i}{\sum} g_{sw,i} \mathbf{L}_{int,i} \;+\; \underset{i'}{\sum} j_{sw,i'} \mathbf{\alpha}_{i'} \right)

is the transgression of variational differential forms (def. ) of the sum of the products of these adiabatic switching with these Lagrangian densities.

In order to discuss the S-matrix $\mathcal{S}(g S_{int} + j A)$ and hence the time-ordered products of the special form $T_k\left( \underset{k \, \text{factors}}{\underbrace{g S_{int} + j A, \cdots, g S_{int} + j A }} \right)$ it is sufficient to restrict attention to the restriction of each $T_k$ to the subspace of local observables induced by the finite set of Lagrangian densities $\{\mathbf{L}_{int,i}, \mathbf{\alpha}_{i'}\}_{i,i'}$ .

This restriction is a continuous linear functional on the corresponding space of bump functions $\{g_{sw,i}, j_{sw,i'}\}$ , hence a dstributional section of a corresponding trivial vector bundle.

In terms of this, prop. says that the choice of time-ordered products $T_k$ is inductively in $k$ a choice of extension of distributions to the diagonal.

If $\Sigma = \mathbb{R}^{p,1}$ is Minkowski spacetime and we impose the renormalization condition “translation invariance” (def. ) then each $T_k$ is a distribution on $\Sigma^{k-1} = \mathbb{R}^{(p+1)(k-1)}$ and the extension of distributions is from the complement of the origina $0 \in \mathbb{R}^{(p+1)(k-1)}$ .

Therefore we now discuss extension of distributions (def. below) on Cartesian spaces from the complement of the origin to the origin. Since the space of choices of such extensions turns out to depend on the scaling degree of distributions, we first discuss that (def. below).

Definition

(rescaled distribution)

Let $n \in \mathbb{N}$ . For $\lambda \in (0,\infty) \subset \mathbb{R}$ a positive real number write

\array{ \mathbb{R}^n &\overset{s_\lambda}{\longrightarrow}& \mathbb{R}^n \\ x &\mapsto& \lambda x }

for the diffeomorphism given by multiplication with $\lambda$ , using the canonical real vector space-structure of $\mathbb{R}^n$ .

Then for $u \in \mathcal{D}'(\mathbb{R}^n)$ a distribution on the Cartesian space $\mathbb{R}^n$ the rescaled distribution is the pullback of $u$ along $m_\lambda$

u_\lambda \coloneqq s_\lambda^\ast u \;\in\; \mathcal{D}'(\mathbb{R}^n) \,.

Explicitly, this is given by

\array{ \mathcal{D}(\mathbb{R}^n) &\overset{ \langle u_\lambda, - \rangle}{\longrightarrow}& \mathbb{R} \\ b &\mapsto& \lambda^{-n} \langle u , b(\lambda^{-1}\cdot (-))\rangle } \,.

Similarly for $X \subset \mathbb{R}^n$ an open subset which is invariant under $s_\lambda$ , the rescaling of a distribution $u \in \mathcal{D}'(X)$ is is $u_\lambda \coloneqq s_\lambda^\ast u$ .

Definition

(scaling degree of a distribution)

Let $n \in \mathbb{N}$ and let $X \subset \mathbb{R}^n$ be an open subset of Cartesian space which is invariant under rescaling $s_\lambda$ (def. ) for all $\lambda \in (0,\infty)$ , and let $u \in \mathcal{D}'(X)$ be a distribution on this subset. Then

The scaling degree of $u$ is the infimum

$sd(u) \;\coloneqq\; inf \left\{ \omega \in \mathbb{R} \;\vert\; \underset{\lambda \to 0}{\lim} \lambda^\omega u_\lambda = 0 \right\}$

of the set of real numbers $\omega$ such that the limit of the rescaled distribution $\lambda^\omega u_\lambda$ (def. ) vanishes. If there is no such $\omega$ one sets $sd(u) \coloneqq \infty$ .
The degree of divergence of $u$ is the difference of the scaling degree by the dimension of the underlying space:

deg(u) \coloneqq sd(u) - n \,.

Example

(scaling degree of non-singular distributions)

If $u = u_f$ is a non-singular distribution given by bump function $f \in C^\infty(X) \subset \mathcal{D}'(X)$ , then its scaling degree (def. ) is non-positive

sd(u_f) \leq 0 \,.

Specifically if the first non-vanishing partial derivative $\partial_\alpha f(0)$ of $f$ at 0 occurs at order ${\vert \alpha\vert} \in \mathbb{N}$ , then the scaling degree of $u_f$ is $-{\vert \alpha\vert}$ .

Proof

By definition we have for $b \in C^\infty_{cp}(\mathbb{R}^n)$ any bump function that

\begin{aligned} \left\langle \lambda^{\omega} (u_f)_\lambda, n \right\rangle & = \lambda^{\omega-n} \underset{\mathbb{R}^n}{\int} f(x) g(\lambda^{-1} x) d^n x \\ & = \lambda^{\omega} \underset{\mathbb{R}^n}{\int} f(\lambda x) g(x) d^n x \end{aligned} \,,

where in last line we applied change of integration variables.

The limit of this expression is clearly zero for all $\omega \gt 0$ , which shows the first claim.

If moreover the first non-vanishing partial derivative of $f$ occurs at order ${\vert \alpha \vert} = k$ , then Hadamard's lemma says that $f$ is of the form

f(x) \;=\; \left( \underset{i}{\prod} \alpha_i ! \right)^{-1} (\partial_\alpha f(0)) \underset{i}{\prod} (x^i)^{\alpha_i} + \underset{ {\beta \in \mathbb{N}^n} \atop { {\vert \beta\vert} = {\vert \alpha \vert} + 1 } }{\sum} \underset{i}{\prod} (x^i)^{\beta_i} h_{\beta}(x)

where the $h_{\beta}$ are smooth functions. Hence in this case

\begin{aligned} \left\langle \lambda^{\omega} (u_f)_\lambda, n \right\rangle & = \lambda^{\omega + {\vert \alpha\vert }} \underset{\mathbb{R}^n}{\int} \left( \underset{i}{\prod} \alpha_i ! \right)^{-1} (\partial_\alpha f(0)) \underset{i}{\prod} (x^i)^{\alpha_i} b(x) d^n x \\ & \phantom{=} + \lambda^{\omega + {\vert \alpha\vert} + 1} \underset{\mathbb{R}^n}{\int} \underset{i}{\prod} (x^i)^{\beta_i} h_{\beta}(x) b(x) d^n x \end{aligned} \,.

This makes manifest that the expression goes to zero with $\lambda \to 0$ precisely for $\omega \gt - {\vert \alpha \vert}$ , which means that

sd(u_f) = -{\vert \alpha \vert}

in this case.

Example

(scaling degree of derivatives of delta-distributions)

Let $\alpha \in \mathbb{N}^n$ be a multi-index and $\partial_\alpha \delta \in \mathcal{D}'(X)$ the corresponding partial derivatives of the delta distribution $\delta_0 \in \mathcal{D}'(\mathbb{R}^n)$ supported at $0$ . Then the degree of divergence (def. ) of $\partial_\alpha \delta_0$ is the total order the derivatives

deg\left( {\, \atop \,} \partial_\alpha\delta_0{\, \atop \,} \right) \;=\; {\vert \alpha \vert}

where ${\vert \alpha\vert} \coloneqq \underset{i}{\sum} \alpha_i$ .

Proof

By definition we have for $b \in C^\infty_{cp}(\mathbb{R}^n)$ any bump function that

\begin{aligned} \left\langle \lambda^\omega (\partial_\alpha \delta_0)_\lambda, b \right\rangle & = (-1)^{{\vert \alpha \vert}} \lambda^{\omega-n} \left( \frac{ \partial^{{\vert \alpha \vert}} }{ \partial^{\alpha_1} x^1 \cdots \partial^{\alpha_n}x^n } b(\lambda^{-1}x) \right)_{\vert x = 0} \\ & = (-1)^{{\vert \alpha \vert}} \lambda^{\omega - n - {\vert \alpha\vert}} \frac{ \partial^{{\vert \alpha \vert}} }{ \partial^{\alpha_1} x^1 \cdots \partial^{\alpha_n}x^n } b(0) \end{aligned} \,,

where in the last step we used the chain rule of differentiation. It is clear that this goes to zero with $\lambda$ as long as $\omega \gt n + {\vert \alpha\vert}$ . Hence $sd(\partial_{\alpha} \delta_0) = n + {\vert \alpha \vert}$ .

Example

(scaling degree of Feynman propagator on Minkowski spacetime)

Let

\Delta_F(x) \;=\; \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu x^\mu} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 + i \epsilon } \, d k_0 \, d^p \vec k

be the Feynman propagator for the massive free real scalar field on $n = p+1$ -dimensional Minkowski spacetime (prop. ). Its scaling degree is

\begin{aligned} sd(\Delta_{F}) & = n - 2 \\ & = p -1 \end{aligned} \,.

(Brunetti-Fredenhagen 00, example 3 on p. 22)

Proof

Regarding $\Delta_F$ as a generalized function via the given Fourier-transform expression, we find by change of integration variables in the Fourier integral that in the scaling limit the Feynman propagator becomes that for vannishing mass, which scales homogeneously:

\begin{aligned} \underset{\lambda \to 0}{\lim} \left( \lambda^\omega \; \Delta_F(\lambda x) \right) & = \underset{\lambda \to 0}{\lim} \left( \lamba^{\omega} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu \lambda x^\mu} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 + i \epsilon } \, d k_0 \, d^p \vec k \right) \\ & = \underset{\lambda \to 0}{\lim} \left( \lambda^{\omega-n} \; \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu \lambda x^\mu} }{ - (\lambda^{-2}) k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 + i \epsilon } \, d k_0 \, d^p \vec k \right) \\ & = \underset{\lambda \to 0}{\lim} \left( \lambda^{\omega-n + 2 } \; \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu \lambda x^\mu} }{ - k_\mu k^\mu + i \epsilon } \, d k_0 \, d^p \vec k \right) \,. \end{aligned}

Proposition

(basic properties of scaling degree of distributions)

Let $X \subset \mathbb{R}^n$ and $u \in \mathcal{D}'(X)$ be a distribution as in def. , such that its scaling degree is finite: $sd(u) \lt \infty$ (def. ). Then

For $\alpha \in \mathbb{N}^n$ , the partial derivative of distributions $\partial_\alpha$ increases scaling degree at most by ${\vert \alpha\vert }$ :

$deg(\partial_\alpha u) \;\leq\; deg(u) + {\vert \alpha\vert}$
For $\alpha \in \mathbb{N}^n$ , the product of distributions with the smooth coordinate functions $x^\alpha$ decreases scaling degree at least by ${\vert \alpha\vert }$ :

$deg(x^\alpha u) \;\leq\; deg(u) - {\vert \alpha\vert}$
Under tensor product of distributions their scaling degrees add:

$sd(u \otimes v) \leq sd(u) + sd(v)$

for $v \in \mathcal{D}'(Y)$ another distribution on $Y \subset \mathbb{R}^{n'}$ ;
$deg(f u) \leq deg(u) - k$ for $f \in C^\infty(X)$ and $f^{(\alpha)}(0) = 0$ for ${\vert \alpha\vert} \leq k-1$ ;

(Brunetti-Fredenhagen 00, lemma 5.1, Dütsch 18, exercise 3.34)

Proof

The first three statements follow with manipulations as in example and example .

For the fourth…

Proposition

(scaling degree of product distribution)

Let $u,v \in \mathcal{D}'(\mathbb{R}^n)$ be two distributions such that

both have finite degree of divergence (def. )

$deg(u), deg(v) \lt \infty$
their product of distributions is well-defined

$u v \in \mathcal{D}'(\mathbb{R}^n)$

(in that their wave front sets satisfy Hörmander's criterion)

then the product distribution has degree of divergence bounded by the sum of the separate degrees:

deg(u v) \;\leq\; deg(u) + deg(v) \,.

With the concept of scaling degree of distributions in hand, we may now discuss extension of distributions:

Definition

(extension of distributions)

Let $X \overset{\iota}{\subset} \hat X$ be an inclusion of open subsets of some Cartesian space. This induces the operation of restriction of distributions

\mathcal{D}'(\hat X) \overset{\iota^\ast}{\longrightarrow} \mathcal{D}'(X) \,.

Given a distribution $u \in \mathcal{D}'(X)$ , then an extension of $u$ to $\hat X$ is a distribution $\hat u \in \mathcal{D}'(\hat X)$ such that

\iota^\ast \hat u \;=\; u \,.

Proposition

(unique extension of distributions with negative degree of divergence)

For $n \in \mathbb{N}$ , let $u \in \mathcal{D}'(\mathbb{R}^n \setminus \{0\})$ be a distribution on the complement of the origin, with negative degree of divergence at the origin

deg(u) \lt 0 \,.

Then $u$ has a unique extension of distributions $\hat u \in \mathcal{D}'(\mathbb{R}^n)$ to the origin with the same degree of divergence

deg(\hat u) = deg(u) \,.

(Brunetti-Fredenhagen 00, theorem 5.2, Dütsch 18, theorem 3.35 a))

Proof

Regarding uniqueness:

Suppose $\hat u$ and ${\hat u}^\prime$ are two extensions of $u$ with $deg(\hat u) = deg({\hat u}^\prime)$ . Both being extensions of a distribution defined on $\mathbb{R}^n \setminus \{0\}$ , this difference has support at the origin $\{0\} \subset \mathbb{R}^n$ . By prop. this implies that it is a linear combination of derivatives of the delta distribution supported at the origin:

{\hat u}^\prime - \hat u \;=\; \underset{ {\alpha \in \mathbb{N}^n} }{\sum} c^\alpha \partial_\alpha \delta_0

for constants $c^\alpha \in \mathbb{C}$ . But by example the degree of divergence of these point-supported distributions is non-negative

deg( \partial_\alpha \delta_0) = {\vert \alpha\vert} \geq 0 \,.

This implies that $c^\alpha = 0$ for all $\alpha$ , hence that the two extensions coincide.

Regarding existence:

Let

b \in C^\infty_{cp}(\mathbb{R}^n)

be a bump function which is $\leq 1$ and constant on 1 over a neighbourhood of the origin. Write

\chi \coloneqq 1 - b \;\in\; C^\infty(\mathbb{R}^n)

graphics grabbed from Dütsch 18, p. 108

and for $\lambda \in (0,\infty)$ a positive real number, write

\chi_\lambda(x) \coloneqq \chi(\lambda x) \,.

Since the product $\chi_\lambda u$ has support of a distribution on a complement of a neighbourhood of the origin, we may extend it by zero to a distribution on all of $\mathbb{R}^n$ , which we will denote by the same symbols:

\chi_\lambda u \in \mathcal{D}'(\mathbb{R}^n) \,.

By construction $\chi_\lambda u$ coincides with $u$ away from a neighbourhood of the origin, which moreover becomes arbitrarily small as $\lambda$ increases. This means that if the following limit exists

\hat u \;\coloneqq\; \underset{\lambda \to \infty}{\lim} \chi_\lambda u

then it is an extension of $u$ .

To see that the limit exists, it is sufficient to observe that we have a Cauchy sequence, hence that for all $b\in C^\infty_{cp}(\mathbb{R}^n)$ the difference

(\chi_{n+1} u - \chi_n u)(b) \;=\; u(b)( \chi_{n+1} + \chi_n )

becomes arbitrarily small.

It remains to see that the unique extension $\hat u$ thus established has the same scaling degree as $u$ . This is shown in (Brunetti-Fredenhagen 00, p. 24).

Proposition

(space of point-extensions of distributions)

For $n \in \mathbb{N}$ , let $u \in \mathcal{D}'(\mathbb{R}^n \setminus \{0\})$ be a distribution of degree of divergence $deg(u) \lt \infty$ .

Then $u$ does admit at least one extension (def. ) to a distribution $\hat u \in \mathcal{D}'(\mathbb{R}^n)$ , and every choice of extension has the same degree of divergence as $u$

deg(\hat u) = deg(u) \,.

Moreover, any two such extensions $\hat u$ and ${\hat u}^\prime$ differ by a linear combination of partial derivatives of distributions of order $\leq deg(u)$ of the delta distribution $\delta_0$ supported at the origin:

{\hat u}^\prime - \hat u \;=\; \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha\vert} \leq deg(u) } }{\sum} q^\alpha \partial_\alpha \delta_0 \,,

for a finite number of constants $q^\alpha \in \mathbb{C}$ .

This is essentially (Hörmander 90, thm. 3.2.4). We follow (Brunetti-Fredenhagen 00, theorem 5.3), which was inspired by (Epstein-Glaser 73, section 5). Review of this approach is in (Dütsch 18, theorem 3.35 (b)), see also remark below.

Proof

For $f \in C^\infty(\mathbb{R}^n)$ a smooth function, and $\rho \in \mathbb{N}$ , we say that $f$ vanishes to order $\rho$ at the origin if all partial derivatives with multi-index $\alpha \in \mathbb{N}^n$ of total order ${\vert \alpha\vert} \leq \rho$ vanish at the origin:

\partial_\alpha f (0) = 0 \phantom{AAA} {\vert \alpha\vert} \leq \rho \,.

By Hadamard's lemma, such a function may be written in the form

(264)

f(x) \;=\; \underset{ {\alpha \in \mathbb{N}^n} \atop { {\vert \alpha \vert} = \rho + 1 } }{\sum} x^\alpha r_\alpha(x)

for smooth functions $r_\alpha \in C^\infty_{cp}(\mathbb{R}^n)$ .

Write

\mathcal{D}_\rho(\mathbb{R}^n) \hookrightarrow \mathcal{D}(\mathbb{R}^n) \coloneqq C^\infty_{cp}(\mathbb{R}^n)

for the subspace of that of all bump functions on those that vanish to order $\rho$ at the origin.

By definition this is equivalently the joint kernel of the partial derivatives of distributions of order ${\vert \alpha\vert}$ of the delta distribution $\delta_0$ supported at the origin:

b \in \mathcal{D}_\rho(\mathbb{R}^n) \phantom{AA} \Leftrightarrow \phantom{AA} \underset{ {\alpha \in \mathbb{N}^n} \atop { {\vert \alpha\vert} \leq \rho } } {\forall} \left\langle \partial_\alpha \delta_0, b \right\rangle = 0 \,.

Therefore every continuous linear projection

(265)

p_\rho \;\colon\; \mathcal{D}(\mathbb{R}^n) \longrightarrow \mathcal{D}_\rho(\mathbb{R}^n)

may be obtained from a choice of dual basis to the $\{\partial_\alpha \delta_0\}$ , hence a choice of smooth functions

\left\{ w^\beta \in C^\infty_{cp}(\mathbb{R}^n) \right\}_{ { \beta \in \mathbb{N}^n } \atop { {\vert \beta\vert} \leq \rho } }

such that

\left\langle \partial_\alpha \delta_0 \,,\, w^\beta \right\rangle \;=\; \delta_\alpha^\beta \phantom{AAA} \Leftrightarrow \phantom{AAA} \partial_\alpha w^\beta(0) \;=\; \delta_\alpha^\beta \phantom{AAAA} \text{for}\, {\vert \alpha\vert} \leq \rho \,,

by setting

(266)

p_\rho \;\coloneqq\; id \;-\; \left\langle \underset{ {\alpha \in \mathbb{N}^n} \atop { {\vert \alpha\vert} \leq \rho } }{\sum} w^\alpha \partial_\alpha \delta_0 \,,\, (-) \right\rangle \,,

hence

p_\rho \;\colon\; b \mapsto b - \underset{ {\alpha \in \mathbb{N}^n} \atop { {\vert \alpha\vert} \leq \rho } }{\sum} (-1)^{{\vert \alpha\vert}} w^\alpha \partial_\alpha b(0) \,.

Together with Hadamard's lemma in the form (264) this means that every $b \in \mathcal{D}(\mathbb{R}^n)$ is decomposed as

(267)

\begin{aligned} b(x) & = p_\rho(b)(x) \;+\; (id - p_\rho)(b)(x) \\ & = \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha \vert} = \rho + 1 } }{\sum} x^\alpha r_\alpha(x) \;+\; \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha \vert} \leq \rho } }{\sum} (-1)^{{\vert \alpha \vert}} w^\alpha \partial_\alpha b(0) \end{aligned}

Now let

\rho \;\coloneqq\; deg(u) \,.

Observe that (by prop. ) the degree of divergence of the product of distributions $x^\alpha u$ with ${\vert \alpha\vert} = \rho + 1$ is negative

\begin{aligned} deg\left( x^\alpha u \right) & = \rho - {\vert \alpha \vert} \leq -1 \end{aligned}

Therefore prop. says that each $x^\alpha u$ for ${\vert \alpha\vert} = \rho + 1$ has a unique extension $\widehat{ x^\alpha u}$ to the origin. Accordingly the composition $u \circ p_\rho$ has a unique extension, by (267):

(268)

\begin{aligned} \left\langle \hat u \,,\, b \right\rangle & = \left\langle \hat u , p_\rho(b) \right\rangle + \left\langle \hat u , (id - p_\rho)(b) \right\rangle \\ & = \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha \vert} = \rho + 1 } }{\sum} \underset{ \text{unique} }{ \underbrace{ \left\langle \widehat{x^\alpha u} \,,\, r_\alpha \right\rangle } } \;+\; \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha\vert} \leq \rho } }{\sum} \underset{ { q^\alpha } \atop { \text{choice} } }{ \underbrace{ \langle \hat u \,,\, w^\alpha \rangle } } \left\langle \partial_\alpha \delta_0 \,,\, b \right\rangle \end{aligned}

That says that $\hat u$ is of the form

\hat u \;=\; \underset{ \text{unique} }{ \underbrace{ \widehat{ u \circ p_\rho } } } + \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha\vert} \leq \rho } }{\sum} c^\alpha \, \partial_\alpha \delta_0

for a finite number of constants $c^\alpha \in \mathbb{C}$ .

Notice that for any extension $\hat u$ the exact value of the $c^\alpha$ here depends on the arbitrary choice of dual basis $\{w^\alpha\}$ used for this construction. But the uniqueness of the first summand means that for any two choices of extensions $\hat u$ and ${\hat u}^\prime$ , their difference is of the form

{\hat u}^\prime - \hat u \;=\; \underset{ { \alpha \in \mathbb{N}^n } \atop { {\vert \alpha\vert} \leq \rho } }{\sum} ( (c')^\alpha - c^\alpha ) \, \partial_\alpha \delta_0 \,,

where the constants $q^\alpha \coloneqq ( (c')^\alpha - c^\alpha ) \in \mathbb{C}$ are independent of any choices.

It remains to see that all these $\hat u$ in fact have the same degree of divergence as $u$ .

By example the degree of divergence of the point-supported distributions on the right is $deg(\partial_\alpha \delta_0) = {\vert \alpha\vert} \leq \rho$ .

Therefore to conclude it is now sufficient to show that

deg\left( \widehat{ u \circ p_\rho } \right) \;=\; \rho \,.

This is shown in (Brunetti-Fredenhagen 00, p. 25).

Remark

(“W-extensions”)

Since in Brunetti-Fredenhagen 00, (38) the projectors (266) are denoted “ $W$ ”, the construction of extensions of distributions via the proof of prop. has come to be called “W-extensions” (e.g Dütsch 18).

In conclusion we obtain the central theorem of causal perturbation theory:

Theorem

(existence and choices of ("re"-)normalization of S-matrices/perturbative QFTs)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge-fixed relativistic free vacuum, according to def. , such that the underlying spacetime is Minkowski spacetime and the Wightman propagator $\Delta_H$ is translation-invariant.

Then:

an S-matrix scheme $\mathcal{S}$ (def. ) around this vacuum exists;
for $g S_{int} + j A \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle$ a local observable as in def. , regarded as an adiabatically switched interaction action functional, the space of possible choices of S-matrices

$\mathcal{S}(g S_{int} + j A) \;\in\; PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]$

hence of the corresponding perturbative QFTs, by prop. , is, inductively in $k \in \mathbb{N}$ , a finite dimensional affine space, parameterizing the extension of the time-ordered product $T_k$ to the locus of coinciding interaction points.

Proof

By prop. the Feynman propagator is finite scaling degree of a distribution, so that by prop. the binary time-ordered product away from the diagonal $T_2(-,-)\vert_{\Sigma^2 \setminus diag(\Sigma)} = (-) \star_{F} (-)$ has finite scaling degree.

By prop. this implies that in the inductive description of the time-ordered products by prop. , each induction step is the extension of distributions of finite scaling degree of a distribution to the point. By prop. this always exists.

This proves the first statement.

Now if a polynomial local interaction is fixed, then via remark each induction step involved extending a finite number of distributions, each of finite scaling degree. By prop. the corresponding space of choices is in each step a finite-dimensional affine space.

$\,$

Stückelberg-Petermann renormalization group

A genuine re-normalization is the passage from one S-matrix ("re"-)normalization scheme $\mathcal{S}$ to another such scheme $\mathcal{S}'$ . The inductive Epstein-Glaser ("re"-normalization) construction (prop. ) shows that the difference between any $\mathcal{S}$ and $\mathcal{S}'$ is inductively in $k \in \mathbb{N}$ a choice of extra term in the time-ordered product of $k$ factors, equivalently in the Feynman amplitudes for Feynman diagrams with $k$ vertices, that contributes when all $k$ of these vertices coincide in spacetime (prop. ).

A natural question is whether these additional interactions that appear when several interaction vertices coincide may be absorbed into a re-definition of the original interaction $g S_{int} + j A$ . Such an interaction vertex redefinition (def. below)

\mathcal{Z} \;\colon\; g S_{int} + j A \;\mapsto\; g S_{int} + j A \;+\; \text{higher order corrections}

should perturbatively send local interactions to local interactions with higher order corrections.

The main theorem of perturbative renormalization (theorem below) says that indeed under mild conditions every re-normalization $\mathcal{S} \mapsto \mathcal{S}'$ is induced by such an interaction vertex redefinition in that there exists a unique such redefinition $\mathcal{Z}$ so that for every local interaction $g S_{int} + j A$ we have that scattering amplitudes for the interaction $g S_{int} + j A$ computed with the ("re"-)normalization scheme $\mathcal{S}'$ equal those computed with $\mathcal{S}$ but applied to the re-defined interaction $\mathcal{Z}(g S_{int} + j A)$ :

\mathcal{S}' \left( {\, \atop \,} g S_{int} + j A {\, \atop \,} \right) \;=\; \mathcal{S}\left( {\, \atop \,} \mathcal{Z}(g S_{int} + j A) {\, \atop \,} \right) \,.

This means that the interaction vertex redefinitions $\mathcal{Z}$ form a group under composition which acts transitively and freely, hence regularly, on the set of S-matrix ("re"-)normalization schemes; this is called the Stückelberg-Petermann renormalization group (theorem below).

$\,$

Definition

(perturbative interaction vertex redefinition)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H)$ be a gauge fixed free field vacuum (def. ).

A perturbative interaction vertex redefinition (or just vertex redefinition, for short) is an endofunction

\mathcal{Z} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle \longrightarrow LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle

on local observables with formal parameters adjoined (def. ) such that there exists a sequence $\{Z_k\}_{k \in \mathbb{N}}$ of continuous linear functionals, symmetric in their arguments, of the form

\left( {\, \atop \,} LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle {\, \atop \,} \right)^{\otimes^k_{\mathbb{C}[ [ \hbar, g, j] ]}} \longrightarrow LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle

such that for all $g S_{int} + j A \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle$ the following conditions hold:

(perturbation)
1. $Z_0(g S_{int + j A}) = 0$
2. $Z_1(g S_{int} + j A) = g S_{int} + j A$
3. and
  
  $\begin{aligned} \mathcal{Z}(g S_{int} + j A) & = Z \exp_\otimes( g S_{int} + j A ) \\ & \coloneqq \underset{k \in \mathbb{N}}{\sum} \frac{1}{k!} Z_k( \underset{ k \, \text{args} }{ \underbrace{ g S_{int} + j A , \cdots, g S_{int} + j A } } ) \end{aligned}$
(field independence) The local observable $\mathcal{Z}(g S_{int} + j A)$ depends on the field histories only through its argument $g S_{int} + j A$ , hence by the chain rule:

(269) $\frac{\delta}{\delta \mathbf{\Phi}^a(x)} \mathcal{Z}(g S_{int} + j A) \;=\; \mathcal{Z}'_{g S_{int} + j A} \left( \frac{\delta}{\delta \mathbf{\Phi}^a(x)} (g S_{int} + j A) \right)$

The following proposition should be compared to the axiom of causal additivity of the S-matrix scheme (230):

Proposition

(local additivity of vertex redefinitions)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H)$ be a gauge fixed free field vacuum (def. ) and let $\mathcal{Z}$ be a vertex redefinition (def. ).

Then for all local observables $O_0, O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]\langle g, j\rangle$ with spacetime support denoted $supp(O_i) \subset \Sigma$ (def. ) we have

(local additivity)

$\begin{aligned} & \left( supp(O_1) \cap supp(O_2) = \emptyset \right) \\ & \Rightarrow \phantom{AA} \mathcal{Z}( O_0 + O_1 + O_2) = \mathcal{Z}( O_0 + O_1 ) - \mathcal{Z}(O_0) + \mathcal{Z}(O_0 + O_2) \end{aligned} \,.$
(preservation of spacetime support)

$supp \left( {\, \atop \,} \mathcal{Z}(O_0 + O_1) - \mathcal{Z}(O_0) {\, \atop \,} \right) \;\subset\; supp(O_1)$

hence in particular

$supp \left( {\, \atop \,} \mathcal{Z}(O_1) {\, \atop \,} \right) = supp(O_1)$

(Dütsch 18, exercise 3.98)

Proof

Under the inclusion

LocObs(E_{\text{BV-BRST}}) \hookrightarrow PolyObs(E_{\text{BV-BRST}})

of local observables into polynomial observables we may think of each $Z_k$ as a generalized function, as for time-ordered products in remark .

Hence if

O_j = \underset{\Sigma}{\int} j^\infty_\Sigma( \mathbf{L}_j )

is the transgression of a Lagrangian density $\mathbf{L}$ we get

Z_k( (O_1 + O_2 + O_3) , \cdots , (O_1 + O_2 + O_3) ) = \underset{ j_1, \cdots, j_k \in \{0,1,2\} }{\sum} \underset{\Sigma^{k}}{\int} Z( \mathbf{L}_{j_1}(x_1) , \cdots , \mathbf{L}_{j_k}(x_k) ) \,.

Now by definition $Z_k(\cdots)$ is in the subspace of local observables, i.e. those polynomial observables whose coefficient distributions are supported on the diagonal, which means that

\frac{\delta}{\delta \mathbf{\Phi}^a(x)} \frac{\delta}{\delta \mathbf{\Phi}^b(y)} Z_{k}(\cdots) = 0 \phantom{AA} \text{for} \phantom{AA} x \neq y

Together with the axiom “field independence” (269) this means that the support of these generalized functions in the integrand here must be on the diagonal, where $x_1 = \cdots = x_k$ .

By the assumption that the spacetime supports of $O_1$ and $O_2$ are disjoint, this means that only the summands with $j_1, \cdots, j_k \in \{0,1\}$ and those with $j_1, \cdots, j_k \in \{0,2\}$ contribute to the above sum. Removing the overcounting of those summands where all $j_1, \cdots, j_k \in \{0\}$ we get

\begin{aligned} & Z_k\left( {\, \atop \,} (O_1 + O_2 + O_3) , \cdots , (O_1 + O_2 + O_3) {\, \atop \,} \right) \\ & = \underset{ j_1, \cdots, j_k \in \{0,1\} }{\sum} \underset{\Sigma^{k}}{\int} Z( \mathbf{L}_{j_1}(x_1) , \cdots , \mathbf{L}_{j_k}(x_k) ) \\ & \phantom{=} - \underset{ j_1, \cdots, j_k \in \{0\} }{\sum} \underset{\Sigma^{k}}{\int} Z( \mathbf{L}_{j_1}(x_1) , \cdots , \mathbf{L}_{j_k}(x_k) ) \\ & \phantom{=} - \underset{ j_1, \cdots, j_k \in \{0,2\} }{\sum} \underset{\Sigma^{k}}{\int} Z( \mathbf{L}_{j_1}(x_1) , \cdots , \mathbf{L}_{j_k}(x_k) ) \\ & = Z_k\left( {\, \atop \,} (O_0 + O_1), \cdots, (O_0 + O_1) {\, \atop \,}\right) - Z_k\left( {\, \atop \,} O_0, \cdots, O_0 {\, \atop \,} \right) + Z_k\left( {\, \atop \,} (O_0 + O_2), \cdots, (O_0 + O_2) {\, \atop \,} \right) \end{aligned} \,.

This directly implies the claim.

As a corollary we obtain:

Proposition

(composition of S-matrix scheme with vertex redefinition is again S-matrix scheme)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H)$ be a gauge fixed free field vacuum (def. ) and let $\mathcal{Z}$ be a vertex redefinition (def. ).

Then for

\mathcal{S} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g,j ] ]

and S-matrix scheme (def. ), the composite

\mathcal{S} \circ \mathcal{Z} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle \overset{\mathcal{Z}}{\longrightarrow} LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g, j\rangle \overset{\mathcal{S}}{\longrightarrow} PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g,j ] ]

is again an S-matrix scheme.

Moreover, if $\mathcal{S}$ satisfies the renormalization condition “field independence” (prop. ), then so does $\mathcal{S} \circ \mathcal{Z}$ .

(e.g Dütsch 18, theorem 3.99 (b))

Proof

It is clear that causal order of the spacetime supports implies that they are in particular disjoint

\left( {\, \atop \,} supp(O_1) {\vee\!\!\!\wedge} supp(O_2) {\, \atop \,} \right) \phantom{AA} \Rightarrow \phantom{AA} \left( {\, \atop \,} supp(O_1) \cap supp(O_) \;=\; \emptyset {\, \atop \,} \right)

Therefore the local additivity of $\mathcal{Z}$ (prop. ) and the causal factorization of the S-matrix (remark ) imply the causal factorization of the composite:

\begin{aligned} \mathcal{S} \left( {\, \atop \,} \mathcal{Z}(O_1 + O_2) {\, \atop \,} \right) & = \mathcal{S} \left( {\, \atop \,} \mathcal{Z}(O_1) + \mathcal{Z}(O_2) {\, \atop \,} \right) \\ & = \mathcal{S} \left( {\, \atop \,} \mathcal{Z}(O_1) {\, \atop \,} \right) \, \mathcal{S} \left( {\, \atop \,} \mathcal{Z}(O_2) {\, \atop \,} \right) \,. \end{aligned}

But by prop. this implies in turn causal additivity and hence that $\mathcal{S} \circ \mathcal{Z}$ is itself an S-matrix scheme.

Finally that $\mathcal{S} \circ \mathcal{Z}$ satisfies “field indepndence” if $\mathcal{S}$ does is immediate by the chain rule, given that $\mathcal{Z}$ satisfies this condition by definition.

Proposition

(any two S-matrix renormalization schemes differ by unique vertex redefinition)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H)$ be a gauge fixed free field vacuum (def. ).

Then for $\mathcal{S}, \mathcal{S}'$ any two S-matrix schemes (def. ) which both satisfy the renormalization condition “field independence”, the there exists a unique vertex redefinition $\mathcal{Z}$ (def. ) relating them by composition, i. e. such that

\mathcal{S}' \;=\; \mathcal{S} \circ \mathcal{Z} \,.

Proof

By applying both sides of the equation to linear combinations of local observables of the form $\kappa_1 O_1 + \cdots + \kappa_k O_k$ and then taking derivatives with respect to $\kappa$ at $\kappa_j = 0$ (as in example ) we get that the equation in question implies

(i \hbar)^k \frac{ \partial^k }{ \partial \kappa_1 \cdots \partial \kappa_k } \mathcal{S}'( \kappa_1 O_1 + \cdots + \kappa_k O_k ) \vert_{\kappa_1, \cdots, \kappa_k = 0} \;=\; (i \hbar)^k \frac{ \partial^k }{ \partial \kappa_1 \cdots \partial \kappa_k } \mathcal{S} \circ \mathcal{Z}( \kappa_1 O_1 + \cdots + \kappa_k O_k ) \vert_{\kappa_1, \cdots, \kappa_k = 0}

which in components means that

\begin{aligned} T'_k( O_1, \cdots, O_k ) & = \underset{ 2 \leq n \leq k }{\sum} \frac{1}{n!} (i \hbar)^{k-n} \underset{ { { I_1 \sqcup \cdots \sqcup I_n } \atop { = \{1, \cdots, k\}, } } \atop { I_1, \cdots, I_n \neq \emptyset } }{\sum} T_n \left( {\, \atop \,} Z_{{\vert I_1\vert}}\left( (O_{i_1})_{i_1 \in I_1} \right), \cdots, Z_{{\vert I_n\vert}}\left( (O_{i_n})_{i_n \in I_n} \right), {\, \atop \,} \right) \\ & \phantom{=} + Z_k( O_1,\cdots, O_k ) \end{aligned}

where $\{T'_k\}_{k \in \mathbb{N}}$ are the time-ordered products corresponding to $\mathcal{S}'$ (by example ) and $\{T_k\}_{k \in \mathcal{N}}$ those correspondong to $\mathcal{S}$ .

Here the sum on the right runs over all ways that in the composite $\mathcal{S} \circ \mathcal{Z}$ a $k$ -ary operation arises as the composite of an $n$ -ary time-ordered product applied to the ${\vert I_i\vert}$ -ary components of $\mathcal{Z}$ , for $i$ running from 1 to $n$ ; except for the case $k = n$ , which is displayed separately in the second line

This shows that if $\mathcal{Z}$ exists, then it is unique, because its coefficients $Z_k$ are inductively in $k$ given by the expressions

(270)

\begin{aligned} & Z_k( O_1,\cdots, O_k ) \\ & = T'_k( O_1, \cdots, O_k ) \;-\; \underset{ (T \circ \mathcal{Z}_{\lt k})_k }{ \underbrace{ \underset{ 2 \leq n \leq k }{\sum} \frac{1}{n!} (i \hbar)^{k-n} \underset{ { { I_1 \sqcup \cdots \sqcup I_n } \atop { = \{1, \cdots, k\}, } } \atop { I_1, \cdots, I_n \neq \emptyset } }{\sum} T_n \left( Z_{{\vert I_1\vert}}( (O_{i_1})_{i_1 \in I_1} ), \cdots, Z_{{\vert I_n\vert}}( (O_{i_n})_{i_n \in I_n} ), \right) } } \end{aligned}

(The symbol under the brace is introduced as a convenient shorthand for the term above the brace.)

Hence it remains to see that the $Z_k$ defined this way satisfy the conditions in def. .

The condition “perturbation” is immediate from the corresponding condition on $\mathcal{S}$ and $\mathcal{S}'$ .

Similarly the condition “field independence” follows immediately from the assumoption that $\mathcal{S}$ and $\mathcal{S}'$ satisfy this condition.

It only remains to see that $Z_k$ indeed takes values in local observables. Given that the time-ordered products a priori take values in the larrger space of microcausal polynomial observables this means to show that the spacetime support of $Z_k$ is on the diagonal.

But observe that, as indicated in the above formula, the term over the brace may be understood as the coefficient at order $k$ of the exponential series-expansion of the composite $\mathcal{S} \circ \mathcal{Z}_{\lt k}$ , where

\mathcal{Z}_{\lt k} \;\coloneqq\; \underset{ n \in \{1, \cdots, k-1\} }{\sum} \frac{1}{n!} Z_n

is the truncation of the vertex redefinition to degree $\lt k$ . This truncation is clearly itself still a vertex redefinition (according to def. ) so that the composite $\mathcal{S} \circ \mathcal{Z}_{\lt k}$ is still an S-matrix scheme (by prop. ) so that the $(T \circ \mathcal{Z}_{\lt k})_k$ are time-ordered products (by example ).

So as we solve $\mathcal{S}' = \mathcal{S} \circ \mathcal{Z}$ inductively in degree $k$ , then for the induction step in degree $k$ the expressions $T'_{\lt k}$ and $(T \circ \mathcal{Z})_{\lt k}$ agree and are both time-ordered products. By prop. this implies that $T'_{k}$ and $(T \circ \mathcal{Z}_{\lt k})_{k}$ agree away from the diagonal. This means that their difference $Z_k$ is supported on the diagonal, and hence is indeed local.

In conclusion this establishes the following pivotal statement of perturbative quantum field theory:

Theorem

(main theorem of perturbative renormalization – Stückelberg-Petermann renormalization group of vertex redefinitions)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H)$ be a gauge fixed free field vacuum (def. ).

the vertex redefinitions $\mathcal{Z}$ (def. ) form a group under composition;
the set of S-matrix ("re"-)normalization schemes (def. ), remark ) satisfying the renormalization condition “field independence” (prop. ) is a torsor over this group, hence equipped with a regular action in that
1. the set of S-matrix schemes is non-empty;
2. any two S-matrix ("re"-)normalization schemes $\mathcal{S}$ , $\mathcal{S}'$ are related by a unique vertex redefinition $\mathcal{Z}$ via composition:
  
  $\mathcal{S}' \;=\; \mathcal{S} \circ \mathcal{Z} \,.$

This group is called the Stückelberg-Petermann renormalization group.

Typically one imposes a set of renormalization conditions (def. ) and considers the corresponding subgroup of vertex redefinitions preserving these conditions.

Proof

The group-structure and regular action is given by prop. and prop. . The existence of S-matrices follows is the statement of Epstein-Glaser ("re"-)normalization in theorem .

$\,$

UV-Regularization via counterterms

While Epstein-Glaser renormalization (prop. ) gives a transparent picture on the space of choices in ("re"-)normalization (theorem ) the physical nature of the higher interactions that it introduces at coincident interaction points (via the extensions of distributions in prop. ) remains more implicit. But the main theorem of perturbative renormalization (theorem ), which re-expresses the difference between any two such choices as an interaction vertex redefinition, suggests that already the choice of ("re"-)normalization itself should have an incarnation in terms of interaction vertex redefinitions.

This may be realized via a construction of ("re"-)normalization in terms of UV-regularization (prop. below): For any choice of “UV-cutoff”, given by an approximation of the Feynman propagator $\Delta_F$ by non-singular distributions $\Delta_{F,\Lambda}$ (def. below) there is a unique “effective S-matrix” $\mathcal{S}_\Lambda$ induced at each cutoff scale (def. below). While the “UV-limit” $\underset{\Lambda \to \infty}{\lim} \mathcal{S}_\Lambda$ does not in general exist, it may be “regularized” by applying suitable interaction vertex redefinitions $\mathcal{Z}_\Lambda$ ; if the higher-order corrections that these introduce serve to “counter” (remark below) the coresponding UV-divergences.

This perspective of ("re"-)normalization via via counterterms is often regarded as the primary one. Its elegant proof in prop. below, however relies on the Epstein-Glaser renormalization via inductive extensions of distributions and uses the same kind of argument as in the proof of the main theorem of perturbative renormalization (theorem via prop. ) that establishes the Stückelberg-Petermann renormalization group.

$\,$

Definition

(UV cutoffs)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge fixed relativistic free vacuum over Minkowski spacetime $\Sigma$ (according to def. ), where $\Delta_H = \tfrac{i}{2}(\Delta_+ - \Delta_-) + H$ is the corresponding Wightman propagator inducing the Feynman propagator

\Delta_F \in \Gamma'_{\Sigma \times \Sigma}(E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}})

by $\Delta_F = \tfrac{i}{2}(\Delta_+ + \Delta_-) + H$ .

Then a choice of UV cutoffs for perturbative QFT around this vacuum is a collection of non-singular distributions $\Delta_{F,\Lambda}$ parameterized by positive real numbers

\array{ (0, \infty) &\overset{}{\longrightarrow}& \Gamma_{\Sigma \times \Sigma,cp}(E_{\text{BV-BRST}} \boxtimes E_{\text{BV-BRST}}) \\ \Lambda &\mapsto& \Delta_{F,\Lambda} }

such that:

each $\Delta_{F,\Lambda}$ satisfies the following basic properties
1. (translation invariance)
  
  $\Delta_{F,\Lambda}(x,y) = \Delta_{F,\Lambda}(x-y)$
2. (symmetry)
  
  $\Delta^{b a}_{F,\Lambda}(y, x) \;=\; \Delta^{a b}_{F,\Lambda}(x, y)$
  
  i.e.
  
  $\Delta_{F,\Lambda}^{b a}(-x) \;=\; \Delta_{F,\Lambda}^{a b}(x)$
the $\Delta_{F,\Lambda}$ interpolate between zero and the Feynman propagator, in that, in the Hörmander topology:
1. the limit as $\Lambda \to 0$ exists and is zero
  
  $\underset{\Lambda \to \infty}{\lim} \Delta_{F,\Lambda} \;=\; 0 \,.$
2. the limit as $\Lambda \to \infty$ exists and is the Feynman propagator:
  
  $\underset{\Lambda \to \infty}{\lim} \Delta_{F,\Lambda} \;=\; \Delta_F \,.$

(Dütsch 10, section 4)

Example

(relativistic momentum cutoff)

Recall from this prop. that the Fourier transform of distributions of the Feynman propagator for the real scalar field on Minkowski spacetime $\mathbb{R}^{p,1}$ is,

\begin{aligned} \widehat{\Delta}_F(k) & = \frac{+i}{(2\pi)^{p+1}} \frac{ 1 }{ - \eta(k,k) - \left( \tfrac{m c}{\hbar} \right)^2 + i 0 } \end{aligned}

To produce a UV cutoff in the sense of def. we would like to set this function to zero for wave numbers $\vert \vec k\vert$ (hence momenta $\hbar\vert \vec k\vert$ ) larger than a given $\Lambda$ .

This needs to be done with due care: First, the Paley-Wiener-Schwartz theorem (prop. ) says that $\Delta_{F,\Lambda}$ to be a test function and hence compactly supported, its Fourier transform $\widehat{\Delta}_{F,\Lambda}$ needs to be smooth and of bounded growth. So instead of multiplying $\widehat{\Delta}_F$ by a step function in $k$ , we may multiply it with an exponential damping.

(Keller-Kopper-Schophaus 97, section 6.1, Dütsch 18, example 3.126)

Definition

(effective S-matrix scheme)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge fixed relativistic free vacuum (according to def. ) and let $\left\{ \Delta_{F,\Lambda}\right\}_{\Lambda \in [0,\infty)}$ be a choice of UV cutoffs for perturbative QFT around this vacuum (def. ).

We say that the effective S-matrix scheme $\mathcal{S}_\Lambda$ at cutoff scale $\Lambda \in [0,\infty)$

\array{ PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ] &\overset{\mathcal{S}_{\Lambda}}{\longrightarrow}& PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ] \\ O &\mapsto& \mathcal{S}_\Lambda(O) }

is the exponential series

(271)

\begin{aligned} \mathcal{S}_\Lambda(O) & \coloneqq \exp_{F,\Lambda}\left( \frac{1}{i \hbar} O \right) \\ & = 1 + \frac{1}{i \hbar} O + \frac{1}{2} \frac{1}{(i \hbar)^2} O \star_{F,\Lambda} O + \frac{1}{3!} \frac{1}{(i \hbar)^3} O \star_{F,\Lambda} O \star_{F,\Lambda} 0 + \cdots \end{aligned} \,.

with respect to the star product $\star_{F,\Lambda}$ induced by the $\Delta_{F,\Lambda}$ (def. ).

This is evidently defined on all polynomial observables as shown, and restricts to an endomorphism on microcausal polynomial observables as shown, since the contraction coefficients $\Delta_{F,\Lambda}$ are non-singular distributions, by definition of UV cutoff.

(Dütsch 10, (4.2))

Proposition

(("re"-)normalization via UV regularization)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge fixed relativistic free vacuum (according to def. ) and let $g S_{int} + j A \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle$ a polynomial local observable as in def. , regarded as an adiabatically switched interaction action functional.

Let moreover $\{\Delta_{F,\Lambda}\}_{\Lambda \in [0,\infty)}$ be a UV cutoff (def. ); with $\mathcal{S}_\Lambda$ the induced effective S-matrix schemes (271).

Then

there exists a $[0,\infty)$ -parameterized interaction vertex redefinition $\{\mathcal{Z}_\Lambda\}_{\Lambda \in \mathbb{R}_{\geq 0}}$ (def. ) such that the limit of effective S-matrix schemes $\mathcal{S}_{\Lambda}$ (271) applied to the $\mathcal{Z}_\Lambda$ -redefined interactions

$\mathcal{S}_\infty \;\coloneqq\; \underset{\Lambda \to \infty}{\lim} \left( \mathcal{S}_\Lambda \circ \mathcal{Z}_\Lambda \right)$

exists and is a genuine S-matrix scheme around the given vacuum (def. );
every S-matrix scheme around the given vacuum arises this way.

These $\mathcal{Z}_\Lambda$ are called counterterms (remark below) and the composite $\mathcal{S}_\Lambda \circ \mathcal{Z}_\Lambda$ is called a UV regularization of the effective S-matrices $\mathcal{S}_\Lambda$ .

Hence UV-regularization via counterterms is a method of ("re"-)normalization of perturbative QFT (def. ).

This was claimed in (Brunetti-Dütsch-Fredenhagen 09, (75)), a proof was indicated in (Dütsch-Fredenhagen-Keller-Rejzner 14, theorem A.1).

Proof

Let $\{p_{\rho_{k}}\}_{k \in \mathbb{N}}$ be a sequence of projection maps as in (265) defining an Epstein-Glaser ("re"-)normalization (prop. ) of time-ordered products $\{T_k\}_{k \in \mathbb{N}}$ as extensions of distributions of the $T_k$ , regarded as distributions via remark , by the choice $q_k^\alpha = 0$ in (268).

We will construct that $\mathcal{Z}_\Lambda$ in terms of these projections $p_\rho$ .

First consider some convenient shorthand:

For $n \in \mathbb{N}$ , write $\mathcal{Z}_{\leq n} \coloneqq \underset{1 \in \{1, \cdots, n\}}{\sum} \frac{1}{n!} Z_n$ . Moreover, for $k \in \mathbb{N}$ write $(T_\Lambda \circ \mathcal{Z}_{\leq n})_k$ for the $k$ -ary coefficient in the expansion of the composite $\mathcal{S}_\Lambda \circ \mathcal{Z}_{\leq n}$ , as in equation (270) in the proof of the main theorem of perturbative renormalization (theorem , via prop. ).

In this notation we need to find $\mathcal{Z}_\Lambda$ such that for each $n \in \mathbb{N}$ we have

(272)

\underset{\Lambda \to \infty}{\lim} \left( T_\Lambda \circ \mathcal{Z}_{\leq n, \Lambda} \right)_n \;=\; T_n \,.

We proceed by induction over $n \in \mathbb{N}$ .

Since by definition $T_0 = const_1$ , $T_1 = id$ and $Z_0 = const_0$ , $Z_1 = id$ the statement is trivially true for $n = 0$ and $n = 1$ .

So assume now $n \in \mathbb{N}$ and $\{Z_{k}\}_{k \leq n}$ has been found such that (272) holds.

Observe that with the chosen renormalizing projection $p_{\rho_{n+1}}$ the time-ordered product $T_{n+1}$ may be expressed as follows:

(273)

\begin{aligned} T_{n+1}(O, \cdots, O) & = \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} T_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F} \left( {\, \atop \,} T_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_k}(O \otimes \cdots \otimes O) \right\rangle \\ & = \left\langle \underset{\Lambda \to \infty}{\lim} \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} T_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} T_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_k}(O \otimes \cdots \otimes O) \right\rangle \end{aligned} \,.

Here in the first step we inserted the causal decomposition (263) of $T_{n+1}$ in terms of the $\{T_k\}_{k \leq n}$ away from the diagonal, as in the proof of prop. , which is admissible because the image of $p_{\rho_{n+1}}$ vanishes on the diagonal. In the second step we replaced the star-product of the Feynman propagator $\Delta_F$ with the limit over the star-products of the regularized propagators $\Delta_{F,\Lambda}$ , which converges by the nature of the Hörmander topology (which is assumed by def. ).

Hence it is sufficient to find $Z_{n+1,\Lambda}$ and $K_{n+1,\Lambda}$ such that

(274)

\begin{aligned} \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\Lambda} \right)_{n+1} \,,\, (-, \cdots, -) \right\rangle & = \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} T_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} T_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_{k}}\left( -, \cdots, - \right) \right\rangle \\ & \phantom{=} + K_{n+1,\Lambda}(-, \cdots, -) \end{aligned}

subject to these two conditions:

$\mathcal{Z}_{n+1,\Lambda}$ is local;
$\underset{\Lambda \to \infty}{\lim} K_{n+1,\Lambda} = 0$ .

Now by expanding out the left hand side of (274) as

(T_\Lambda \circ \mathcal{Z}_\Lambda)_{n+1} \;=\; Z_{n+1,\Lambda} \;+\; (T_\Lambda \circ Z_{\leq n, \Lambda})_{n+1}

(which uses the condition $T_1 = id$ ) we find the unique solution of (274) for $Z_{n+1,\Lambda}$ , in terms of the $\{Z_{\leq n,\Lambda}\}$ and $K_{n+1,\Lambda}$ (the latter still to be chosen) to be:

(275)

\begin{aligned} \left\langle Z_{n+1,\Lambda} , (-,\cdots, -) \right\rangle & = \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} T_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} T_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_{n+1}}(-, \cdots, -) \right\rangle \\ & \phantom{=} - \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\leq n,\Lambda} \right)_{n+1} \,,\, (-, \cdots, -) \right\rangle \\ & \phantom{=} + \left\langle K_{n+1, \Lambda}, (-, \cdots, -) \right\rangle \end{aligned} \,.

We claim that the following choice works:

(276)

\begin{aligned} K_{n+1, \Lambda}(-, \cdots, -) & \coloneqq \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda} \right)_{n+1} \,,\, p_{\rho_{n+1}}(-, \cdots, -) \right\rangle \\ & \phantom{=} - \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} T_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} T_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_{n+1}}(-, \cdots, -) \right\rangle \end{aligned} \,.

To prove this, we need to show that 1) the resulting $Z_{n+1,\Lambda}$ is local and 2) the limit of $K_{n+1,\Lambda}$ vanishes as $\Lambda \to \infty$ .

First regarding the locality of $Z_{n+1,\Lambda}$ : By inserting (276) into (275) we obtain

\begin{aligned} \left\langle Z_{n+1,\Lambda} \,,\, (-,\cdots,-) \right\rangle & = \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\leq n} \right)_{n+1} \,,\, p(-, \cdots, -) \right\rangle - \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\leq n} \right)_{n+1} \,,\, (-, \cdots, -) \right\rangle \\ & = \left\langle \left( T_{\Lambda} \circ \mathcal{Z}_{\leq n} \right)_{n+1} \,,\, ( p_{\rho_{n+1}} - id)(-, \cdots, -) \right\rangle \end{aligned}

By definition $p_{\rho_{n+1}} - id$ is the identity on test functions (adiabatic switchings) that vanish at the diagonal. This means that $Z_{n+1,\Lambda}$ is supported on the diagonal, and is hence local.

Second we need to show that $\underset{\Lambda \to \infty}{\lim} K_{n+1,\Lambda} = 0$ :

By applying the analogous causal decomposition (263) to the regularized products, we find

(277)

\begin{aligned} & \left\langle (T_\Lambda \circ \mathcal{Z}_{\leq n, \Lambda})_{n+1} \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \\ & = \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \,. \end{aligned}

Using this we compute as follows:

(278)

\begin{aligned} & \left\langle \underset{\Lambda \to \infty}{\lim} (T_\Lambda \circ \mathcal{Z}_{\leq n, \Lambda})_{n+1} \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \\ & = \left\langle \underset{\Lambda \to \infty}{\lim} \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { \mathbf{I}, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \!\!\chi_i(\mathbf{X})\, \left( {\, \atop \,} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{{\vert \mathbf{I}\vert}} (\mathbf{I}) {\, \atop \,} \right) \star_{F,\Lambda} \left( {\, \atop \,} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) {\, \atop \,} \right) \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \\ & = \left\langle \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { I, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \chi_i(\mathbf{X})\, \underset{ T_{{\vert \mathbf{I}\vert}}(\mathbf{I}) }{ \underbrace{ \left( \underset{\Lambda \to \infty}{\lim} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{{\vert \mathbf{I}\vert}} (\mathbf{I}) \right) }} \left( \underset{\Lambda \to \infty}{\lim} \star_{F,\Lambda} \right) \underset{ T_{{\vert \overline{\mathbf{I}}\vert}}(\overline{\mathbf{I}}) }{ \underbrace{ \left( \underset{\Lambda \to \infty}{\lim} (T_{\Lambda} \circ \mathcal{Z}_{\leq n, \Lambda})_{ {\vert \overline{\mathbf{I}} \vert} } ( \overline{\mathbf{I}} ) \right) }} \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \\ & = \left\langle \underset{\Lambda \to \infty}{\lim} \underset{ {\mathbf{I} \in \{1, \cdots, n+1\} } \atop { I, \overline{\mathbf{I}} \neq \emptyset } }{\sum} \chi_i(\mathbf{X})\, T_{ { \vert \mathbf{I} \vert } }( \mathbf{I} ) \star_{F,\Lambda} T_{ {\vert \overline{\mathbf{I}} \vert} }( \overline{\mathbf{I}} ) \,,\, p_{\rho_{n+1}}(-,\cdots,-) \right\rangle \end{aligned} \,.

Here in the first step we inserted (277); in the second step we used that in the Hörmander topology the product of distributions preserves limits in each variable and in the third step we used the induction assumption (272) and the definition of UV cutoff (def. ).

Inserting this for the first summand in (276) shows that $\underset{\Lambda \to \infty}{\lim} K_{n+1, \Lambda} = 0$ .

In conclusion this shows that a consistent choice of counterterms $\mathcal{Z}_\Lambda$ exists to produce some S-matrix $\mathcal{S} = \underset{\Lambda \to \infty }{\lim} (\mathcal{S}_\Lambda \circ \mathcal{Z}_\Lambda)$ . It just remains to see that for every other S-matrix $\widetilde{\mathcal{S}}$ there exist counterterms $\widetilde{\mathcal{Z}}_\lambda$ such that $\widetilde{\mathcal{S}} = \underset{\Lambda \to \infty }{\lim} (\mathcal{S}_\Lambda \circ \widetilde{\mathcal{Z}}_\Lambda)$ .

But by the main theorem of perturbative renormalization (theorem ) we know that there exists a vertex redefinition $\mathcal{Z}$ such that

\begin{aligned} \widetilde{\mathcal{S}} & = \mathcal{S} \circ \mathcal{Z} \\ & = \underset{\Lambda \to \infty}{\lim} \left( \mathcal{S}_\Lambda \circ \mathcal{Z}_\Lambda \right) \circ \mathcal{Z} \\ & = \underset{\Lambda \to \infty}{\lim} ( \mathcal{S}_\Lambda \circ ( \underset{ \widetilde{\mathcal{Z}}_\Lambda }{ \underbrace{ \mathcal{Z}_\Lambda \circ \mathcal{Z} } } ) ) \end{aligned}

and hence with counterterms $\mathcal{Z}_\Lambda$ for $\mathcal{S}$ given, then counterterms for any $\widetilde{\mathcal{S}}$ are given by the composite $\widetilde{\mathcal{Z}}_\Lambda \coloneqq \mathcal{Z}_\Lambda \circ \mathcal{Z}$ .

Remark

(counterterms)

Consider

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle

a local observable, regarded as an adiabatically switched interaction action functional.

Then prop. says that there exist vertex redefinitions of this interaction

\mathcal{Z}_\Lambda(g S_{int} + j A) \;\in\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle

parameterized by $\Lambda \in [0,\infty)$ , such that the limit

\mathcal{S}_\infty(g S_{int} + j A) \;\coloneqq\; \underset{\Lambda \to \infty}{\lim} \mathcal{S}_\Lambda\left( \mathcal{Z}_\Lambda( g S_{int} + j A )\right)

exists and is an S-matrix for perturbative QFT with the given interaction $g S_{int} + j A$ .

In this case the difference

\begin{aligned} S_{counter, \Lambda} & \coloneqq \left( g S_{int} + j A \right) \;-\; \mathcal{Z}_{\Lambda}(g S_{int} + j A) \;\;\;\;\;\in\; Loc(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g^2, j^2, g j\rangle \end{aligned}

(which by the axiom “perturbation” in def. is at least of second order in the coupling constant/source field, as shown) is called a choice of counterterms at cutoff scale $\Lambda$ . These are new interactions which are added to the given interaction at cutoff scale $\Lambda$

\mathcal{Z}_{\Lambda}(g S_{int} + j A) \;=\; g S_{int} + j A \;+\; S_{counter,\Lambda} \,.

In this language prop. says that for every free field vacuum and every choice of local interaction, there is a choice of counterterms to the interaction that defines a corresponding ("re"-)normalized perturbative QFT, and every (re"-)normalized perturbative QFT arises from some choice of counterterms.

$\,$

Wilson-Polchinski effective QFT flow

We have seen above that a choice of UV cutoff induces effective S-matrix schemes $\mathcal{S}_\Lambda$ at cutoff scale $\Lambda$ (def. ). To these one may associated non-local relative effective actions $S_{eff,\Lambda}$ (def. below) which are such that their effective scattering amplitudes at scale $\Lambda$ coincide with the true scattering amplitudes of a genuine local interaction as the cutoff is removed. This is the Wilsonian picture of effective quantum field theory at a given cutoff scale (remark below). Crucially the “flow” of the relative effective actions with the cutoff scale satisfies a differential equation that in itself is independent of the full UV-theory; this is Polchinski's flow equation (prop. below). Solving this equation for given choice of initial value data is hence another way of choosing ("re"-)normalization constants.

$\,$

Proposition

(effective S-matrix schemes are invertible functions)

Write

PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]\langle g,j\rangle \hookrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]

for the subspace of the space of formal power series in $\hbar, g, j$ with coefficients polynomial observables on those which are at least of first order in $g,j$ , i.e. those that vanish for $g, j = 0$ (as in def. ).

Write moreover

1 + PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]\langle g,j\rangle \hookrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]

for the subspace of polynomial observables which are the sum of 1 (the multiplicative unit) with an observable at least linear n $g,j$ .

Then the effective S-matrix schemes $\mathcal{S}_\Lambda$ (def. ) restrict to linear isomorphisms of the form

PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]\langle g,j\rangle \underoverset{\simeq}{\phantom{AA}\mathcal{S}_\Lambda \phantom{AA} }{\longrightarrow} 1 + PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]\langle g,j\rangle \,.

(Dütsch 10, (4.7))

Proof

Since each $\Delta_{F,\Lambda}$ is symmetric (def. ) if follows by general properties of star products (prop. ) just as for the genuine time-ordered product on regular polynomial observables (prop. ) that eeach the “effective time-ordered product” $\star_{F,\Lambda}$ is isomorphic to the pointwise product $(-)\cdot (-)$ (def. )

A_1 \star_{F,\Lambda} A_2 \;=\; \mathcal{T}_\Lambda \left( \mathcal{T}_\Lambda^{-1}(A_1) \cdot \mathcal{T}_\Lambda^{-1}(A_2) \right)

for

\mathcal{T}_\Lambda \;\coloneqq\; \exp \left( \tfrac{1}{2}\hbar \underset{\Sigma}{\int} \Delta_{F,\Lambda}^{a b}(x,y) \frac{\delta^2}{\delta \mathbf{\Phi}^a(x) \delta \mathbf{\Phi}^b(y)} \right)

as in (?).

In particular this means that the effective S-matrix $\mathcal{S}_\Lambda$ arises from the exponential series for the pointwise product by conjugation with $\mathcal{T}_\Lambda$ :

\mathcal{S}_\Lambda \;=\; \mathcal{T}_\Lambda \circ \exp_\cdot\left( \frac{1}{i \hbar}(-) \right) \circ \mathcal{T}_\Lambda^{-1}

(just as for the genuine S-matrix on regular polynomial observables in def. ).

Now the exponential of the pointwise product on $1 + PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle$ has as inverse function the natural logarithm power series, and since $\mathcal{T}$ evidently preserves powers of $g,j$ this conjugates to an inverse at each UV cutoff scale $\Lambda$ :

(279)

\mathcal{S}_\Lambda^{-1} \;=\; \mathcal{T}_\Lambda \circ \ln\left( i \hbar (-) \right) \circ \mathcal{T}_\Lambda^{-1} \,.

Definition

(relative effective action)

Consider

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BrST}})[ [ \hbar, g, j] ]\langle g, j\rangle

a local observable regarded as an adiabatically switched interaction action functional.

Then for

\Lambda,\, \Lambda_{vac} \;\in\; (0, \infty)

two UV cutoff-scale parameters, we say the relative effective action $S_{eff, \Lambda, \Lambda_0}$ is the image of this interaction under the composite of the effective S-matrix scheme $\mathcal{S}_{\Lambda_0}$ at scale $\Lambda_0$ (271) and the inverse function $\mathcal{S}_\Lambda^{-1}$ of the effective S-matrix scheme at scale $\Lambda$ (via prop. ):

(280)

S_{eff,\Lambda, \Lambda_0} \;\coloneqq\; \mathcal{S}_{\Lambda}^{-1} \circ \mathcal{S}_{\Lambda_0}(g S_{int} + j A) \phantom{AAA} \Lambda, \Lambda_0 \in [0,\infty) \,.

For chosen counterterms (remark ) hence for chosen UV regularization $\mathcal{S}_\infty$ (prop. ) this makes sense also for $\Lambda_0 = \infty$ and we write:

(281)

S_{eff,\Lambda} \;\coloneqq\; S_{eff,\Lambda, \infty} \;\coloneqq\; \mathcal{S}_{\Lambda}^{-1} \circ \mathcal{S}_{\infty}(g S_{int} + j A) \phantom{AAA} \Lambda \in [0,\infty)

(Dütsch 10, (5.4))

Remark

(effective quantum field theory)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge fixed relativistic free vacuum (according to def. ), let $\left\{ \Delta_{F,\Lambda}\right\}_{\Lambda \in [0,\infty)}$ be a choice of UV cutoffs for perturbative QFT around this vacuum (def. ), and let $\mathcal{S}_\infty = \underset{\Lambda \to \infty}{\lim} \mathcal{S}_\Lambda \circ \mathcal{Z}_\Lambda$ be a corresponding UV regularization (prop. ).

Consider a local observable

g S_{int} + j A \;\in\; LocObs(E_{\text{BV-BrST}})[ [ \hbar, g, j] ]\langle g, j\rangle

regarded as an adiabatically switched interaction action functional.

Then def. and def. say that for any $\Lambda \in (0,\infty)$ the effective S-matrix (271) of the relative effective action (280) equals the genuine S-matrix $\mathcal{S}_\infty$ of the genuine interaction $g S_{int} + j A$ :

\mathcal{S}_\Lambda( S_{eff,\Lambda} ) \;=\; \mathcal{S}_\infty\left( g S_{int} + j A \right) \,.

In other words the relative effective action $S_{eff,\Lambda}$ encodes what the actual perturbative QFT defined by $\mathcal{S}_\infty\left( g S_{int} + j A \right)$ effectively looks like at UV cutoff $\Lambda$ .

Therefore one says that $S_{eff,\Lambda}$ defines effective quantum field theory at UV cutoff $\Lambda$ .

Notice that in general $S_{eff,\Lambda}$ is not a local interaction anymore: By prop. the image of the inverse $\mathcal{S}^{-1}_\Lambda$ of the effective S-matrix is microcausal polynomial observables in $1 + PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]\langle g,j\rangle$ and there is no guarantee that this lands in the subspace of local observables.

Therefore effective quantum field theories at finite UV cutoff-scale $\Lambda \in [0,\infty)$ are in general not local field theories, even if their limit as $\Lambda \to \infty$ is, via prop. .

Proposition

(effective action is relative effective action at $\Lambda = 0$ )

Then the relative effective action (def. ) at $\Lambda = 0$ is the actual effective action (def. ) in the sense of the the Feynman perturbation series of Feynman amplitudes $\Gamma(g S_{int} + j A)$ (def. ) for connected Feynman diagrams $\Gamma$ :

\begin{aligned} S_{eff,0} & \coloneqq\; S_{eff,0,\infty} \\ & = S_{eff} \;\coloneqq\; \underset{\Gamma \in \Gamma_{conn}}{\sum} \Gamma(g S_{int} + j A) \,. \end{aligned}

More generally this holds true for any $\Lambda \in [0, \infty) \sqcup \{\infty\}$

\begin{aligned} S_{eff,0,\Lambda} & = \underset{\Gamma \in \Gamma_{conn}}{\sum} \Gamma_\Lambda(g S_{int} + j A) \,, \end{aligned}

where $\Gamma_\Lambda( g S_{int} + j A)$ denotes the evident version of the Feynman amplitude (def. ) with time-ordered products replaced by effective time ordered product at scale $\Lambda$ as in (def. ).

(Dütsch 18, (3.473))

Proof

Observe that the effective S-matrix scheme at scale $\Lambda = 0$ (271) is the exponential series with respect to the pointwise product (def. )

\mathcal{S}_0(O) = \exp_\cdot( O ) \,.

Therefore the statement to be proven says equivalently that the exponential series of the effective action with respect to the pointwise product is the S-matrix:

\exp_\cdot\left( \frac{1}{i \hbar} S_{eff} \right) \;=\; \mathcal{S}_\infty\left( g S_{int} + j A \right) \,.

That this is the case is the statement of prop. .

The definition of the relative effective action $\mathcal{S}_{eff,\Lambda} \coloneqq \mathcal{S}_{eff,\Lambda, \infty}$ in def. invokes a choice of UV regularization $\mathcal{S}_\infty$ (prop. ). While (by that proposition and the main theorem of perturbative renormalization, theorem )this is guaranteed to exist, in practice one is after methods for constructing this without specifying it a priori.

But the collection relative effective actions $\mathcal{S}_{eff,\Lambda, \Lambda_0}$ for $\Lambda_0 \lt \infty$ “flows” with the cutoff-parameters $\Lambda$ and in particular also with $\Lambda_0$ (remark below) which suggests that examination of this flow yields information about full theory at $\mathcal{S}_\infty$ .

This is made precise by Polchinski's flow equation (prop. below), which is the infinitesimal version of the “Wilsonian RG flow” (remark ). As a differential equation it is independent of the choice of $\mathcal{S}_{\infty}$ and hence may be used to solve for the Wilsonian RG flow without knowing $\mathcal{S}_\infty$ in advance.

The freedom in choosing the initial values of this differential equation corresponds to the ("re"-)normalization freedom in choosing the UV regularization $\mathcal{S}_\infty$ . In this sense “Wilsonian RG flow” is a method of ("re"-)normalization of perturbative QFT (def. ).

Remark

(Wilsonian groupoid of effective quantum field theories)

Then the relative effective actions $\mathcal{S}_{eff,\Lambda, \Lambda_0}$ (def. ) satisfy

S_{eff, \Lambda', \Lambda_0} \;=\; \left( \mathcal{S}_{\Lambda'}^{-1} \circ \mathcal{S}_\Lambda \right) \left( S_{eff, \Lambda, \Lambda_0} \right) \phantom{AAA} \text{for} \, \Lambda,\Lambda' \in [0,\infty) \,,\, \Lambda_0 \in [0,\infty) \sqcup \{\infty\} \,.

This is similar to a group of UV-cutoff scale-transformations. But since the composition operations are only sensible when the UV-cutoff labels match, as shown, it is really a groupoid action.

This is often called the Wilsonian RG.

We now consider the infinitesimal version of this “flow”:

Proposition

(Polchinski's flow equation)

Let $(E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )$ be a gauge fixed relativistic free vacuum (according to def. ), let $\left\{ \Delta_{F,\Lambda}\right\}_{\Lambda \in [0,\infty)}$ be a choice of UV cutoffs for perturbative QFT around this vacuum (def. ), such that $\Lambda \mapsto \Delta_{F,\Lambda}$ is differentiable.

Then for every choice of UV regularization $\mathcal{S}_\infty$ (prop. ) the corresponding relative effective actions $S_{eff,\Lambda}$ (def. ) satisfy the following differential equation:

\frac{d}{d \Lambda} S_{eff,\Lambda} \;=\; - \frac{1}{2} \frac{1}{i \hbar} \frac{d}{d \Lambda'} \left( S_{eff,\Lambda} \star_{F,\Lambda'} S_{eff,\Lambda} \right)\vert_{\Lambda' = \Lambda} \,,

where on the right we have the star product induced by $\Delta_{F,\Lambda'}$ (def. ).

This goes back to (Polchinski 84, (27)). The rigorous formulation and proof is due to (Brunetti-Dütsch-Fredenhagen 09, prop. 5.2, Dütsch 10, theorem 2).

Proof

First observe that for any polynomial observable $O \in PolyObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]$ we have

\begin{aligned} & \frac{1}{(k+2)!} \frac{d}{d \Lambda} ( \underset{ k+2 \, \text{factors} }{ \underbrace{ O \star_{F,\Lambda} \cdots \star_{F,\Lambda} O } } ) \\ & = \frac{1}{(k+2)!} \frac{d}{d \Lambda} \left( prod \circ \exp\left( \hbar \underset{1 \leq i \lt j \leq k}{\sum} \left\langle \Delta_{F,\Lambda} , \frac{\delta}{\delta \mathbf{\Phi}_i} \frac{\delta}{\delta \mathbf{\Phi}_j} \right\rangle \right) ( \underset{ k + 2 \, \text{factors} }{ \underbrace{ O \otimes \cdots \otimes O } } ) \right) \\ & = \underset{ = \frac{1}{2} \frac{1}{k!} }{ \underbrace{ \frac{1}{(k+2)!} \left( k + 2 \atop 2 \right) }} \left( \frac{d}{d \Lambda} O \star_{F,\Lambda} O \right) \star_{F,\Lambda} \underset{ k \, \text{factors} }{ \underbrace{ O \star_{F,\Lambda} \cdots \star_{F,\Lambda} O } } \end{aligned}

Here $\frac{\delta}{\delta \mathbf{\Phi}_i}$ denotes the functional derivative of the $i$ th tensor factor of $O$ , and the binomial coefficient counts the number of ways that an unordered pair of distinct labels of tensor factors may be chosen from a total of $k+2$ tensor factors, where we use that the star product $\star_{F,\Lambda}$ is commutative (by symmetry of $\Delta_{F,\Lambda}$ ) and associative (by prop. ).

With this and the defining equality $\mathcal{S}_\Lambda(S_{eff,\Lambda}) = \mathcal{S}(g S_{int} + j A)$ (281) we compute as follows:

\begin{aligned} 0 & = \frac{d}{d \Lambda} \mathcal{S}(g S_{int} + j A) \\ & = \frac{d}{d \Lambda} \mathcal{S}_\Lambda(S_{eff,\Lambda}) \\ & = \left( \frac{1}{i \hbar} \frac{d}{d \Lambda} S_{eff,\Lambda} \right) \star_{F,\Lambda} \mathcal{S}_\Lambda(S_{eff,\Lambda}) + \left( \frac{d}{d \Lambda} \mathcal{S}_{\Lambda} \right) \left( S_{eff, \Lambda} \right) \\ & = \left( \frac{1}{i \hbar} \frac{d}{d \Lambda} S_{eff,\Lambda} \right) \star_{F,\Lambda} \mathcal{S}_\Lambda(S_{eff,\Lambda}) \;+\; \frac{1}{2} \frac{d}{d \Lambda'} \left( \frac{1}{i \hbar} S_{eff,\Lambda} \star_{F,\Lambda'} \frac{1}{i \hbar} S_{eff, \Lambda} \right) \vert_{\Lambda' = \Lambda} \star_{F,\Lambda} \mathcal{S}_\Lambda \left( S_{eff, \Lambda} \right) \end{aligned}

Acting on this equation with the multiplicative inverse $(-) \star_{F,\Lambda} \mathcal{S}_\Lambda( - S_{eff,\Lambda} )$ (using that $\star_{F,\Lambda}$ is a commutative product, so that exponentials behave as usual) this yields the claimed equation.

$\,$

renormalization group flow

In perturbative quantum field theory the construction of the scattering matrix $\mathcal{S}$ , hence of the interacting field algebra of observables for a given interaction $g S_{int}$ perturbing around a given free field vacuum, involves choices of normalization of time-ordered products/Feynman diagrams (traditionally called "re"-normalizations) encoding new interactions that appear where several of the original interaction vertices defined by $g S_{int}$ coincide.

Whenever a group $RG$ acts on the space of observables of the theory such that conjugation by this action takes ("re"-)normalization schemes into each other, then these choices of ("re"-)normalization are parameterized by – or “flow with” – the elements of $RG$ . This is called renormalization group flow (prop. below); often called RG flow, for short.

The archetypical example here is the group $RG$ of scaling transformations on Minkowski spacetime (def. below), which induces a renormalization group flow (prop. below) due to the particular nature of the Wightman propagator resp. Feynman propagator on Minkowski spacetime (example below). In this case the choice of ("re"-)normalization hence “flows with scale”.

Now the main theorem of perturbative renormalization (theorem ) states that (if only the basic renormalization condition called “field independence” is satisfied) any two choices of ("re"-)normalization schemes $\mathcal{S}$ and $\mathcal{S}'$ are related by a unique interaction vertex redefinition $\mathcal{Z}$ , as

\mathcal{S}' = \mathcal{S} \circ \mathcal{Z} \,.

Applied to a parameterization/flow of renormalization choices by a group $RG$ this hence induces an interaction vertex redefinition as a function of $RG$ . One may think of the shape of the interaction vertices as fixed and only their (adiabatically switched) coupling constants as changing under such an interaction vertex redefinition, and hence then one has coupling constants $g_j$ that are parameterized by elements $\rho$ of $RG$ :

\mathcal{Z}_{\rho_{vac}}^\rho \;\colon\; \{g_j\} \mapsto \{g_j(\rho)\}

This dependendence is called running of the coupling constants under the renormalization group flow (def. below).

One example of renormalization group flow is that induced by scaling transformations (prop. below). This is the original and main example of the concept (Gell-Mann & Low 54)

In this case the running of the coupling constants may be understood as expressing how “more” interactions (at higher energy/shorter wavelength) become visible (say to experiment) as the scale resolution is increased. In this case the dependence of the coupling $g_j(\rho)$ on the parameter $\rho$ happens to be differentiable; its logarithmic derivative (denoted “ $\psi$ ” in Gell-Mann & Low 54) is known as the beta function (Callan 70, Symanzik 70):

\beta(g) \coloneqq \rho \frac{\partial g_j}{\partial \rho} \,.

The running of the coupling constants is not quite a representation of the renormalization group flow, but it is a “twisted” representation, namely a group 1-cocycle (prop. below). For the case of scaling transformations this may be called the Gell-Mann-Low renormalization cocycle (Brunetti-Dütsch-Fredenhagen 09).

$\,$

Proposition

(renormalization group flow)

Let

vac \;\coloneqq\; (E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )

be a relativistic free vacuum (according to def. ) around which we consider interacting perturbative QFT.

Consider a group $RG$ equipped with an action on the Wick algebra of off-shell microcausal polynomial observables with formal parameters adjoined (as in def. )

rg_{(-)} \;\colon\; RG \times PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ g, j ] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})_{mc}((\hbar))[ [ \hbar, g, j ] ] \,,

hence for each $\rho \in RG$ a continuous linear map $rg_\rho$ which has an inverse $rg_\rho^{-1} \in RG$ and is a homomorphism of the Wick algebra-product (the star product $\star_H$ induced by the Wightman propagator of the given vauum $vac$ )

rg_\rho( A_1 \star_H A_2 ) \;=\; rg_\rho(A_1) \star_H rg_\rho(A_2)

such that the following conditions hold:

the action preserves the subspace of off-shell polynomial local observables, hence it restricts as

$rg_{(-)} \;\colon\; RG \times LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]\langle g,j\rangle \longrightarrow LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j ] ]\langle g,j\rangle$
the action respects the causal order of the spacetime support (def. ) of local observables, in that for $O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]$ we have

$\left( supp(O_1) \,{\vee\!\!\!\wedge}\, supp(O_2) \right) \phantom{A} \Rightarrow \phantom{A} \left( supp(rg_\rho(O_1)) \,{\vee\!\!\!\wedge}\, supp(rg_\rho(O_2)) \right)$

for all $\rho \in RG$ .

Then:

The operation of conjugation by this action on observables induces an action on the set of S-matrix renormalization schemes (def. , remark ), in that for

\mathcal{S} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar , g, j] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})( (\hbar) )[ [ g, j] ]

a perturbative S-matrix scheme around the given free field vacuum $vac$ , also the composite

\mathcal{S}^\rho \;\coloneqq\; rg_\rho \circ \mathcal{S} \circ rg_{\rho}^{-1} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar , g, j] ] \longrightarrow PolyObs(E_{\text{BV-BRST}})( (\hbar) )[ [ g, j] ]

is an S-matrix scheme, for all $\rho \in RG$ .

More generally, let

vac_\rho \;\coloneqq\; (E_{\text{BV-BRST}}, \mathbf{L}'_\rho, \Delta_{H,\rho} )

be a collection of gauge fixed free field vacua parameterized by elements $\rho \in RG$ , all with the same underlying field bundle; and consider $rg_\rho$ as above, except that it is not an automorphism of any Wick algebra, but an isomorphism between the Wick algebra-structures on various vacua, in that

(282)

rg_{\rho}( A_1 \star_{H, \rho^{-1} \rho_{vac}} A_2 ) \;=\; rg_{\rho}(A_1) \star_{H, \rho_{vac}} rg_{\rho}(A_2)

for all $\rho, \rho_{vac} \in RG$

Then if

\{ \mathcal{S}_{\rho} \}_{\rho \in RG}

is a collection of S-matrix schemes, one around each of the gauge fixed free field vacua $vac_\rho$ , it follows that for all pairs of group elements $\rho_{vac}, \rho \in RG$ the composite

(283)

\mathcal{S}_{\rho_{vac}}^\rho \;\coloneqq\; rg_\rho \circ \mathcal{S}_{\rho^{-1}\rho_{vac}} \circ rg_\rho^{-1}

is an S-matrix scheme around the vacuum labeled by $\rho_{vac}$ .

Since therefore each element $\rho \in RG$ in the group $RG$ picks a different choice of normalization of the S-matrix scheme around a given vacuum at $\rho_{vac}$ , we call the assignment $\rho \mapsto \mathcal{S}_{\rho_{vac}}^{\rho}$ a re-normalization group flow.

(Brunetti-Dütsch-Fredenhagen 09, sections 4.2, 5.1, Dütsch 18, section 3.5.3)

Proof

It is clear from the definition that each $\mathcal{S}^{\rho}_{\rho_{vac}}$ satisfies the axiom “perturbation” (in def. ).

In order to verify the axiom “causal additivity”, observe, for convenience, that by prop. it is sufficient to check causal factorization.

So consider $O_1, O_2 \in LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]\langle g,j\rangle$ two local observables whose spacetime support is in causal order.

supp(O_1) \;{\vee\!\!\!\wedge}\; supp(O_2) \,.

We need to show that the

\mathcal{S}_{\rho_{vac}}^{\rho}(O_1 + O_2) = \mathcal{S}_{\rho_{vac}}^\rho(O_1) \star_{H,\rho_{vac}} \mathcal{S}_{vac_e}^\rho(O_2)

for all $\rho, \rho_{vac} \in RG$ .

Using the defining properties of $rg_{(-)}$ and the causal factorization of $\mathcal{S}_{\rho^{-1}\rho_{vac}}$ we directly compute as follows:

\begin{aligned} \mathcal{S}_{\rho_{vac}}^\rho(O_1 + O_2) & = rg_\rho \circ \mathcal{S}_{\rho^{-1} \rho_{vac}} \circ rg_\rho^{-1}( O_1 + O_2 ) \\ & = rg_\rho \left( {\, \atop \,} \mathcal{S}_{\rho^{-1}\rho_{vac}} \left( rg_\rho^{-1}(O_1) + rg_\rho^{-1}(O_2) \right) {\, \atop \,} \right) \\ & = rg_\rho \left( {\, \atop \,} \left( \mathcal{S}_{\rho^{-1}\rho_{vac}}\left(rg_\rho^{-1}(O_1)\right) \right) \star_{H, \rho^{-1} \rho_{vac}} \left( \mathcal{S}_{ \rho^{-1} \rho_{vac} }\left(rg_\rho^{-1}(O_2)\right) \right) {\, \atop \,} \right) \\ & = rg_\rho \left( {\, \atop \,} \mathcal{S}_{\rho^{-1} \rho_{vac}}\left(rg_{\rho^{-1}}(O_1)\right) {\, \atop \,} \right) \star_{H, \rho_{vac}} rg_\rho \left( {\, \atop \,} \mathcal{S}_{\rho^{-1} \rho_{vac}}\left( rg_\rho^{-1}(O_2)\right) {\, \atop \,} \right) \\ & = \mathcal{S}^\rho_{\rho_{vac}}( O_1 ) \, \star_{H, \rho_{vac}} \, \mathcal{S}_{\rho_{vac}}^\rho(O_2) \,. \end{aligned}

Definition

(running coupling constants)

Let

vac \coloneqq vac_e \;\coloneqq\; (E_{\text{BV-BRST}}, \mathbf{L}', \Delta_H )

be a relativistic free vacuum (according to def. ) around which we consider interacting perturbative QFT, let $\mathcal{S}$ be an S-matrix scheme around this vacuum and let $rg_{(-)}$ be a renormalization group flow according to prop. , such that each re-normalized S-matrix scheme $\mathcal{S}_{vac}^\rho$ satisfies the renormalization condition “field independence”.

Then by the main theorem of perturbative renormalization (theorem , via prop. ) there is for every pair $\rho_1, \rho_2 \in RG$ a unique interaction vertex redefinition

\mathcal{Z}_{\rho_{vac}}^{\rho} \;\colon\; LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ] \longrightarrow LocObs(E_{\text{BV-BRST}})[ [ \hbar, g, j] ]

which relates the corresponding two S-matrix schemes via

(284)

\mathcal{S}_{\rho_{vac}}^{\rho} \;=\; \mathcal{S}_{\rho_{vac}} \circ \mathcal{Z}_{\rho_{vac}}^\rho \,.

If one thinks of an interaction vertex, hence a local observable $g S_{int}+ j A$ , as specified by the (adiabatically switched) coupling constants $g_j \in C^\infty_{cp}(\Sigma)\langle g \rangle$ multiplying the corresponding interaction Lagrangian densities $\mathbf{L}_{int,j} \in \Omega^{p+1,0}_\Sigma(E_{\text{BV-BRST}})$ as

g S_{int} \;=\; \underset{j}{\sum} \tau_\Sigma \left( g_j \mathbf{L}_{int,j} \right)

(where $\tau_\Sigma$ denotes transgression of variational differential forms) then $\mathcal{Z}_{\rho_1}^{\rho_2}$ exhibits a dependency of the (adiabatically switched) coupling constants $g_j$ of the renormalization group flow parameterized by $\rho$ . The corresponding functions

\mathcal{Z}_{\rho_{vac}}^{\rho}(g S_{int}) \;\colon\; (g_j) \mapsto (g_j(\rho))

are then called running coupling constants.

(Brunetti-Dütsch-Fredenhagen 09, sections 4.2, 5.1, Dütsch 18, section 3.5.3)

Proposition

(running coupling constants are group cocycle over renormalization group flow)

Consider running coupling constants

\mathcal{Z}_{\rho_{vac}}^{\rho} \;\colon\; (g_j) \mapsto (g_j(\rho))

as in def. . Then for all $\rho_{vac}, \rho_1, \rho_2 \in RG$ the following equality is satisfied by the “running functions” (284):

\mathcal{Z}_{\rho_{vac}}^{\rho_1 \rho_2} \;=\; \mathcal{Z}_{\rho_{vac}}^{\rho_1} \circ \left( \sigma_{\rho_1} \circ \mathcal{Z}_{\rho^{-1} \rho_{vac}}^{\rho_2} \circ \sigma_{\rho_1}^{-1} \right) \,.

(Brunetti-Dütsch-Fredenhagen 09 (69), Dütsch 18, (3.325))

Proof

Directly using the definitions, we compute as follows:

\begin{aligned} \mathcal{S}_{\rho_{vac}} \circ \mathcal{Z}_{\rho_{vac}}^{\rho_1 \rho_2} & = \mathcal{S}_{\rho_{vac}}^{\rho_1 \rho_2 } \\ & = \sigma_{\rho_1} \circ \underset{ = \mathcal{S}_{\rho_1^{-1} \rho_{vac}}^{\rho_2} = \mathcal{S}_{\rho_1^{-1} \rho_{vac}} \circ \mathcal{Z}_{\rho_1^{-1} \rho_vac}^{\rho_2} }{ \underbrace{ \sigma_{\rho_2} \circ \mathcal{S}_{\rho_2^{-1}\rho_1^{-1}\rho_{vac}} \circ \sigma_{\rho_2}^{-1} }} \circ \sigma_{\rho_1}^{-1} \\ & = \underset{ = \mathcal{S}_{\rho_{vac}} \circ \mathcal{Z}_{\rho_{vac}}^{\rho_1} \circ \sigma_{\rho_1} }{ \underbrace{ \sigma_{\rho_1} \circ \mathcal{S}_{\rho_1^{-1} \rho_{vac}} \circ \overset{ = id }{ \overbrace{ \sigma_{\rho_1}^{-1} \circ \sigma_{\rho_1} } } }} \circ \mathcal{Z}_{\rho_1^{-1} \rho_{vac}}^{\rho_2} \circ \sigma_{\rho_1}^{-1} \\ & = \mathcal{S}_{\rho_{vac}} \circ \mathcal{Z}_{\rho_{vac}}^{\rho_1} \circ \underbrace{ \sigma_{\rho_1} \circ \mathcal{Z}_{\rho_1^{-1} \rho_{vac}}^{\rho_2} \circ \sigma_{\rho_1}^{-1} } \end{aligned}

This demonstrates the equation between vertex redefinitions to be shown after composition with an S-matrix scheme. But by the uniqueness-clause in the main theorem of perturbative renormalization (theorem ) the composition operation $\mathcal{S}_{\rho_{vac}} \circ (-)$ as a function from vertex redefinitions to S-matrix schemes is injective. This implies the equation itself.

$\,$

Gell-Mann Low RG flow

We discuss (prop. below) that, if the field species involved have well-defined mass dimension (example below) then scaling transformations on Minkowski spacetime (example below) induce a renormalization group flow (def. ). This is the original and main example of renormalization group flows (Gell-Mann& Low 54).

Example

(scaling transformations and mass dimension)

Let

E \overset{fb}{\longrightarrow} \Sigma

be a field bundle which is a trivial vector bundle over Minkowski spacetime $\Sigma = \mathbb{R}^{p,1} \simeq_{\mathbb{R}} \mathbb{R}^{p+1}$ .

For $\rho \in (0,\infty) \subset \mathbb{R}$ a positive real number, write

\array{ \Sigma &\overset{\rho}{\longrightarrow}& \Sigma \\ x &\mapsto& \rho x }

for the operation of multiplication by $\rho$ using the real vector space-structure of the Cartesian space $\mathbb{R}^{p+1}$ underlying Minkowski spacetime.

By pullback this acts on field histories (sections of the field bundle) via

\array{ \Gamma_\Sigma(E) &\overset{\rho^\ast}{\longrightarrow}& \Gamma_\Sigma(E) \\ \Phi &\mapsto& \Phi(\rho(-)) } \,.

Let then

\rho \mapsto vac_\rho \;\coloneqq\; (E_{\text{BV-BRST}}, \mathbf{L}'_{\rho}, \Delta_{H,\rho} )

be a 1-parameter collection of relativistic free vacua on that field bundle, according to def. , and consider a decomposition into a set $Spec$ of field species (def. ) such that for each $sp \in Spec$ the collection of Feynman propagators $\Delta_{F,\rho,sp}$ for that species scales homogeneously in that there exists

dim(sp) \in \mathbb{R}

such that for all $\rho$ we have (using generalized functions-notation)

(285)

\rho^{ 2 dim(sp) } \Delta_{F, 1/\rho, sp}( \rho x ) \;=\; \Delta_{F,sp, \rho = 1}(x) \,.

Typically $\rho$ rescales a mass parameter, in which case $dim(sp)$ is also called the mass dimension of the field species $sp$ .

Let finally

\array{ PolyObs(E) & \overset{ \sigma_\rho }{\longrightarrow} & PolyObs(E) \\ \mathbf{\Phi}_{sp}^a(x) &\mapsto& \rho^{- dim(sp)} \mathbf{\Phi}^a( \rho^{-1} x ) }

be the function on off-shell polynomial observables given on field observables $\mathbf{Phi}^a(x)$ by pullback along $\rho^{-1}$ followed by multiplication by $\rho$ taken to the negative power of the mass dimension, and extended from there to all polynomial observables as an algebra homomorphism.

This constitutes an action of the group

RG \coloneqq \left( \mathbb{R}_+, \cdot \right)

of positive real numbers (under multiplication) on polynomial observables, called the group of scaling transformations for the given choice of field species and mass parameters.

(Dütsch 18, def. 3.19)

Example

(mass dimension of scalar field)

Consider the Feynman propagator $\Delta_{F,m}$ of the free real scalar field on Minkowski spacetime $\Sigma = \mathbb{R}^{p,1}$ for mass parameter $m \in (0,\infty)$ ; a Green function for the Klein-Gordon equation.

Let the group $RG \coloneqq (\mathbb{R}_+, \cdots)$ of scaling transformations $\rho \in \mathbb{R}_+$ on Minkowski spacetime (def. ) act on the mass parameter by inverse multiplication

(\rho , \Delta_{F,m}) \mapsto \Delta_{F,\rho^{-1}m}(\rho (-)) \,.

Then we have

\Delta_{F,\rho^{-1}m}(\rho (-)) \;=\; \rho^{-(p+1) + 2} \Delta_{F,1}(x)

and hence the corresponding mass dimension (def. ) of the real scalar field on $\mathbb{R}^{p,1}$ is

dim(\text{scalar field}) = (p+1)/2 - 1 \,.

Proof

By prop. the Feynman propagator in question is given by the Cauchy principal value-formula (in generalized function-notation)

\begin{aligned} \Delta_{F,m}(x) & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu x^\mu} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k \,. \end{aligned}

By applying change of integration variables $k \mapsto \rho^{-1} k$ in the Fourier transform this becomes

\begin{aligned} \Delta_{F,\rho^{-1}m}(\rho x) & = \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu \rho x^\mu} }{ - k_\mu k^\mu - \left( \rho^{-1} \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k \\ & = \rho^{-(p+1)} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu x^\mu} }{ - \rho^{-2} k_\mu k^\mu - \rho^{-2} \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k \\ & = \rho^{-(p+1)+2} \underset{ {\epsilon \in (0,\infty)} \atop {\epsilon \to 0} }{\lim} \frac{+i}{(2\pi)^{p+1}} \int \int_{-\infty}^\infty \frac{ e^{i k_\mu x^\mu} }{ - k_\mu k^\mu - \left( \tfrac{m c}{\hbar} \right)^2 \pm i \epsilon } \, d k_0 \, d^p \vec k \\ & = \rho^{-(p+1) + 2} \Delta_{F,m}(x) \end{aligned}

Proposition

(scaling transformations are renormalization group flow)

Let

vac \coloneqq vac_m \coloneqq (E_{\text{BV-BRST}}, \mathbf{L}', \Delta_{H,m})

be a relativistic free vacua on that field bundle, according to def. equipped with a decomposition into a set $Spec$ of field species (def. ) such that for each $sp \in Spec$ the collection of Feynman propagators the corresponding field species has a well-defined mass dimension $dim(sp)$ (def. )

Then the action of the group $RG \coloneqq (\mathbb{R}_+, \cdot)$ of scaling transformations (def. ) is a renormalization group flow in the sense of prop. .

(Dütsch 18, exercise 3.20)

Proof

It is clear that rescaling preserves causal order and the renormalization condition of “field indepencen”.

The condition we need to check is that for $A_1, A_2 \in PolyObs(E_{\text{BV-BRST}})_{mc}[ [ \hbar, g, j] ]$ two microcausal polynomial observables we have for any $\rho, \rho_{vac} \in \mathbb{R}_+$ that

\sigma_\rho \left( A_1 \star_{H, \rho^{-1} \rho_{vac} c} A_2 \right) \;=\; \sigma_\rho(A_1) \star_{H,\rho_{vac}} \sigma_\rho(A_2) \,.

By the assumption of decomposition into free field species $sp \in Spec$ , it is sufficient to check this for each species $\Delta_{H,sp}$ . Moreover, by the nature of the star product on polynomial observables, which is given by iterated contractions with the Wightman propagator, it is sufficient to check this for one such contraction.

Observe that the scaling behaviour of the Wightman propagator $\Delta_{H,m}$ is the same as the behaviour (285) of the correspponding Feynman propagator. With this we directly compute as follows:

\begin{aligned} \sigma_\rho (\mathbf{\Phi}(x)) \star_{F, \rho_{vac} m} \sigma_\rho (\mathbf{\Phi}(y) & = \rho^{-2 dim } \mathbf{\Phi}(\rho^{-1} x) \star_{F, \rho_{vac} m} \mathbf{\Phi}(\rho^{-1} y) \\ & = \rho^{-2 dim } \Delta_{F, \rho_{vac} m}(\rho^{-1}(x-y)) \\ & = \Delta_{F, \rho^{-1}\rho_{vac}m }(x,y) \mathbf{1} \\ & = rg_{\rho}\left( \Delta_{F, \rho^{-1}\rho_{vac}m }(x,y) \mathbf{1} \right) \\ & = rg_{\rho} \left( \mathbf{\Phi}(x) \star_{F, \rho^{-1} \rho_{vac} m} \mathbf{\Phi}(y) \right) \end{aligned} \,.

$\,$

This concludes our discussion of renormalization.

Last revised on December 29, 2023 at 12:07:38. See the history of this page for a list of all contributions to it.